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Human genomic sequence 
1 GATCAGACTT TGAAGAGTGT TTGTACCATG CTAAAGTTTA CAGAATTTAT 
51 TCCTGCTCTT TGAGGGTGCA TTGCAAATCC AGGCTAGAGG GAGAGATACC 
101 AGTTAGGAXA GTACAGCAAT ACTCTACTGG GAAATGGTGA GGTGTTTCGT 
151 GAAGACAATG GCAACACAGA TGAAGACATG CAGATGGAGG AAATAAAGAT 
201 CCAGTTGAGC TTGTTGGCCA GTTGGATAGA GGTTGAGGTT ATGCATGATG 
251 GAGCAATCTA GGTTTTTGTC TTGGGTAGGT GTTTCCATGA TAGTACTCAG 
3 01 AATGAATCAT AT AG TTG T AC AGGTTGAATC CCACCCATGT TTGCACAATA 
3 51 GAGTGACTGT CTAGCTGAAA TCCAGATGAC ACTCTGTATG CTAAGCTATG 
401 CTTCATGGAA CTGTATAAAG GCACTTGCTA CATAGGCTAG TGGCAGATCT 
451 GGAAGTAACC TATATGGTAT ATAGGAAATG AGGTGGCTTT TGTATAAATC 
501 CTACAGATAA ATTTCATTTC CTGATCCTAT TATTTTGACT CATGTTAGCC 
551 CAAGAAGAGT ATTCAGTACT TCATATCCCT GAAGGTAAGA CAGAGTAGTA 
601 TTAGATTCAC TATTTGGCAA ATAAAAGGGA TCAAGTCCTA AGATCAAGCT 
651 GATGAATCAA CACCTCATAG GATATGTCCC AACCAATTAT ATGGCTTCCC 
7 01 CTATAAATAA AATCTAGTTC TCTTCTCTGG AGAGGAACAG TGAAGAATAT 
751 CATAACCTAT GCTACAAACT GC TTG AG TAG GAGCTACTTC TCTCCAAGGC 
801 TTTATATCAT TCATTCTGGC AGGCCCCTCT GTTTGTTCTC ACCAGCTCCT 
851 GGGAAATTTA TTTCTCCTCT AG TGATATAA AAGCTCTCTG TTTGAGATGA 
9 01 AGGGCTGCCC AGTTTATCAG ATCTGTATTA GTCTGTTCTC AGGCTGCTAA 
9 51 TAAAGACATA CCTGAGACTG AGTAATTTAT GAAGGAAAGA GGTTTAATTG 
1001 ACTCACAGTT CCACATGGCT GGGGAGGCCT CACAATCATG GCGAAAGACT 
1051 AATAAGGAGC AAAGTCACAT CTTACATGGC TGCAGACAAG AGAGCATGTG 
1101 CAGGGGAACT GCTCTCCATA AAACCATCAG ATCTTGTGAG ACTTGTTCAC 
1151 TATTACAAGA ACAACAGACA GGAAAACCCG CCCCCTCAAT TCAATTACCT 
12 01 GCCACTGGGA CCCTCCCACA ACACATGGGG ATTATGAGAG CTACAATTCA 

12 51" AGATGAGATT TGGGTGGGGA TACCGCCAAA CCATATGAAG TTCTTTCTTT 

13 01 GTTACTGGGT ACCATATCCA TTCTGTTGAG GTTCTGAGCC TTTCCAGTTA 

13 51 CTGTAACTCC TCTATCTCCT GTCTGTGCTA AGACTCAGTG ACCTCTCTCT 
1401 GCCTTGCTTC TGCTTTGTCC TGACCCTTTC TGTGCATGCA CTCACTCTAG 

14 51 TTTGCCCACC TGAGGTGAGA GATGGTCCAG ATTAGCAACA ACAATCTGTG 
1501 GACTAAAATC CTCTTTAGGG AGGAAGCAAA ATTCAGATGG ATGTTACTAA 
1551 ACAAAGCTCA GAAACAGAGA CCAGGGTGTG GGAAGTAAGG TAGTAGCCTG 
1601 AGAGCAGCTG GC AGTGTTTT AGACCTGGAG GGAGGTTAGG TCATCAGCAA 
1651 TGAGGAGACT GCCTGGAAAA TCCTAGAAAA TTAAGACATC TGGTCAGGCA 
17 01 AGGTCATATC ACCAGCACAC TTCCCTTTTC AAGTTGAATC CCTTTCCTCT 
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1751 


GTTAAGAGGA 


TTCAAGTGTC 


TTTCTTGCAT 


TTTGTCTTCT 


CTTCTATATC 


1801 


CATGCTTGCA 


ATATAAGGAG 


ACAGCAGTTG 


GCTGTTTGTG 


CTAGAAAATA 


1851 


TAAATGGCCA 


TTTTGAAAGC 


ATGCCAGACA 


GGATCTGCGG 


CAAGTTTTCA 


1901 


ATGTTACTGC 


TGCCATCTGT 


TGTTCTTCAG 


TGCTGGGATG 


TGAATCTCTT 


1951 


GGCAAACATC 


TCTCTAATTC 


TGAACTATCT 


TTCACCCCCA 


TCTAGAGATA 


2 001 


TTCACTTACT 


GAAGTGCCTT 


TTTAAAGCAA 


TGTTCCTCAC 


CAAGGCGATG 


2051 


TTCTGAATGT 


TTTAAAATGG 


AAGAATCTGG 


AATGTTTTTA 


TTATAATACA 


2101 


TTTTGTATAT 


CCCAAAGCAA 


AAATCAATTT 


CTTCATGGTT 


AATACTTTTG 


2151 


TAATTTTGTT 


TTTAATAATA 




AAATATAAGA 


AATATTTTAT 


2201 


TGAATTAATA 


CTTTAATGTA 


GCTGTTTCAA 


GTAAGATAAA 


ACAGAACAGA 


2251 


TTACTGTTTT 


CAACCTTGTT 


CACAGTTAGC 


TCTGTAACTA 


AGTTGTTGAG 


2301 


CTTTATCTAA 


GCTTTTTTAT 


TTTTACATAA 


CGTTTCCCTT 


TTCACTTAAC 


2351 


CTTGAAATTA 


TAGTAATTTG 


GGAACTTCTA 


TTCCTCTGAA 


AG AG AAAG C T 


2401 


AATGCCAAAG 


ATATTTCAAG 


GGAGAAAGAA 


GG TTTTTAAA 


AGGAGAGACA 


2451 


ATTCAGCTCA 


GACTTAATAG 


CTGTGATTGC 


TATTTATTAA 


GCAGAACGCC 


2501 


TATAACTAAA 


TTCTCAGATA 


TCCAAAAAAC 


AGCCTGTACA 


TTCTCAAAAG 


2551 


TGAAGATTAC 


ACATTTTCTA 


AGTTAAGGTA 


AAAGTTTTGT 


CTCTGTAGCA 


2601 


TCTTACTGAT 


TTCTATCTTC 


TCATTCTGCC 


TTAATAATGT 


CACTAAATAA 


2651 


ATGTTTGATG 


CACTAATACA 


TGAATAAAAC 


TATTCATGGT 


AATGATTCTT 


2701 


TAGAAACACA 


GCTAAGTTTT 


GTAATTTTGT 


TTTTTAAAAA 


TTAAAAATTT 


2751 


AAATATAAAA 


ATGTTTTTAA 


AAGGCTTGAA 


TTTCTTG TTA 


AATGTACACA 


2801 


TTTTAAGTTG 


TAGGCTGTCT 


TTAAAAATAA 


TCTCTCCACA 


CACTGTAGTA 


2851 


TTTAAAACAT 


CATGATATTA 


C TAT AAAAC A 


TCAACAAATA 


GGGCAGTGGA 


2901 


AAAC ATGGTA 


ATCACTAAAA 


ATGCTCACAT 


GTCATATATT 


AAGACTTGAT 


2951 


AAGTAAACCA 


CAATAATAAA 


TAGAAAAGAA 


ATAGTTGTCT 


AAAAAGGGAT 


3001 


TCTCACCTTT 


CAAACCTTAC 


CATAAAAATG 


GAATATAAAA 


GAAGGAAGAG 


3051 


GAGGAGAAAT 


CAAATTATAT 


CATAAAATTT 


TCTGGGCAAA 


AATATTACAG 


3101 


AAGAAAATAA 


GAAAGATTTA 


TGGAGTTGAC 


TGAAACATTT 


TTGAATCCTA 


•J 1 CI 


TACATAAAAA 


TATCGTTAAT 


TAAAAGGAAA 


AACAAAGAAA 


CAGATTTGGG 


3201 


AAATATTTGA 


A AfTYSf^'P'P'P'P 


X J. X X X£\\3\~£\ 1 




AA 1 AC AAA I Lj 


3251 


GATTATTTAA 


ACTCCATTGC 


AAAAATACAC 


AAAGGACATT 


GACAATGTCT 


3301 


GGAAATAAAA 


TTAGCTAAGT 


AAGTTATAGA 


AAAACTCAGT 


CTCACAATTT 


3351 


GACAAATGTA 


ACTGAAAACT 


ATTAATATAA 


TTAGTAACTA 


TTTTTACATG 


3401 


TCAAAATTTT 


TGAATTACTA 


AAGGAAACCA 


CAATGCCTGA 


AAGTATCCAG 


3451 


GGTTTTTTTT 


TTTTTTTATA 


ATATTGGCAC 


TGTCATATGG 


GTGGCAGGAA 
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3501 


TTGAAGTGAT 


GTTGTTTCTT 


C AG TTATTAA 


GTTGCATCTG 


CAGTGTTTCA 


3551 


AATGTCCAAA 


ACCTGTGAGT 


CAGTAATTCT 


CTTTTTGTAT 


ATTTATCCTA 


3601 


ATACAATAAT 


TCTAAACATA 


ATCTCAATAT 


ATATGTACAA 


AGTTATTCAC 


3651 


TGCAGTGTTA 


CTTACAATAG 


TTAGAAAATT 


GTAAAATGCT 


TTATGCATCT 


3701 


TAAAATATAA 


ATTGTTGAAT 


ATATAATAGT 


CCATATGATA 


TAATTATATC 


37.51 


ATTATTATAA 


ATAATGAATT 


AGAAAATAAT 


TTAAGAGCAT 


TAAAATAATT 


3801 


ATAAGGTAAT 


ATGAAGTGAA 


TGAATAATGT 


ACAGATACTA 


TAATCAGCAG 


3851 


AGTGTTAACT 


AGGTAAATTT 


TTATGTGTGT 


ATATACTACT 


TCCTAAAAAT 


3901 


GACTTGACAG 


AAATCATCAA 


AATGCTAATG 


GTGGTTACTT 


CTGGGTGGGA 


3951 


ATACAGATGA 


TTTACTTTGT 


TCCTTTTATG 


TATTTCTGCA 


CTGCCCAGTC 


4001 


TTCCACAGTG 


AGCATATATT 


GGTTTTTAAA 


TTTATATAAG 


ATGGAAAAAG 


4051 


ATACCAAATG 


GTCTTCAATG 


AATCCTGGAG 


TTAACTTTCA 


TGTGTGTCAT 


4101 


ATGTTATATT 


CTAAACTTAT 


CACAAATAGA 


AGACTTTAAA 


TCAACTTGTA 


4151 


CCTATTTCAA 


CTATATAACA 


GCATCTTTAA 


AATGAGCATT 


GAATTAAACT 


4201 


ACCAAAAPCA 


ACCATCATGA 


GGATTATTCA 


AGTAATGTGT 


TTAAACAAAA 


4251 


GAATTTGTAA 


TAAAATTACT 


TTATCTCCTT 


TGTGATTTCA 


GCCCATTTAA 


4301 


AAAAAATAGA 


TGTTTCTACT 


CTCCTTCAGA 


TATCATTAAA 


ACATAAACTT 


4351 


GTGCCTGACT 


GCATAAATCC 


CTTTTAAACT 


AATATCACTT 


ATTACGTTTA 


4401 


ACTAAGTCTA 


CCTAGGGCTT 


CCTTGTATAA 


AGAACAAGAG 


CTTTCCATTT 


4451 


TTTGTTTACC 


TAGCCCTTTC 


TGATGCCACG 


ACAGAATAGC 


TGTAAATCTT 


4501 


CATTATTTAT 


ATTCTAGAGA 


AAATAAAAGC 


AAATAAAAAG 


GTCAGTGTAT 


4551 


AAAGTTTATT 


GGTTGTTCTC 


TTTACTCAAA 


ACCCACATGG 


TATTAATGTT 


4601 


AGTCTCTATG 


AATATTTCAT 


GGATAAAATC 


AGAGCATTAA 


GTGCATACTA 


4651 


AAAACAATAA 


GAATGGAAAG 


ACTTTAACCT 


TATGTTTATA 


TGAATTTCTA 


4701 


GGTTATCAAG 


AAGTTTATAG 


GCTATAGGCT 


ATAAAGTCTT 


AGGCTATGAT 


4751 


ATAGTAACCT 


AATGTAGACT 


TCCCTTGATA 


CATGAAAATA 


ATGGTACTAA 


4801 ' 


GTACAAACAG 


AAGATGAGCT 


TAAAATTATT 


CTTTGAGTCC 


TCTTGATGGA 


4851 


TTTTTTCCCC 


CACACTTTCC 


CCAAAATTGT 


TTTATGCCTA 


TATTGTAGGA 


4901 


GACCATGCAA 


GAGACCTAGA 


GTCTCTTTTT 


CTTTCATCAC 


TTTCCAATCA 


4951 


ACAGCAAATC 


CTATCATTTT 


TACCACAAAA 


TATATCTTGA 


AACTCCCTTC 


5001 


TTTTGATTTA 


CTTGTAACTC 


CCCATCAAAA 


ACTGAAGAGT 


GTCACAATAC 


5051 


TTCATTAAGT 


TCCCTACTTG 


CACTCTACCT 


TTAATATATT 


TGTAGCACTA 


5101 


AAATGTTTTT 


AAAACATATA 


TCTGCTTATG 


TCATTTTACT 


GCTCAATACT 


5151 


ATCTGATTTT 


CTATTGCACT 


TCTAAGATAC 


TCTAATTTCT 


TAGCACTCTA 


5201 


TATAAAATCC 


TTTAAGGGCT 


TCCCTGCTCA 


CCTTTTCAGA 


CTCAGAACTA 


5251 


TGTATTTCCT 


TTTGCCTGCT 


GTACTTGTAC 


CACTGGATTC 


TTGATTTTTG 
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5301 


TTACTTCCAG 


GTTTTTACAC 


TTATTTTTAC 


AATAAATGTG 


AAATACCCTT 


5351 


TTTGACAATA 


TCTACAAATA 


TTTCTTATTT 


GTCTTTATTG 


CTCTTTCCTG 


5401 


TAATGTTTAG 


TCTTCATTTT 


CCTGATAATG 


GCTATCTAAA 


GTTATCTCCT 


5451 


CAAAGAAGCA 


GTTATTTATT 


CACCCAAATC 


TTCTAGTCCT 


TCTCTGGAGT 


5501 


TTTCTTCTCA 


CTTCATTCCC 


TTGG TTTTTG 


CCACAATTTG 


TAATAATTTG 


5551 


CAATTTGGAG 


TGTTAGAATG 


AGGGAATAAA 


TCACAGGTAA 


TGACTATAGT 


5601 


TTGTGACTAT 


GTAAGATTGG 


ATTCGTTATT 


GATTTATTCC 


ACAAACACTG 


5651 


AGGCACTGCA 


TTTAGCCAAA 


TGCCAATCTT 


GGGCAGTGAG 


ACTCTGAAAG 


5701 


AGAATCTGCT 


TCCCCCACCA 


TAAACTACAA 


AGTGAAACAA 


CTCAGAATGT 


5751 


ACATAAATTA 


CAGAATGAAA 


GCACACTAGA 


AGTAAACACA 


GATGTGGAAG 


5801 


AGGTAAAGTG 


TCCTTGAAAA 


TCATGGAAAG 


ATTCATAAAG 


GGAATGACAT 


5851 


TTCAACTGGA 


TTCTAAACCA 


GTTATTCAAG 


CTCCACAAGG 


TTGCACAGTA 


5901 


AATGAGCAGT 


GGCAGGATGA 


CATACCTTAG 


AAAGTAAAAG 


GAATCTTTTT 


5951 


TAAACTGCTA 


TAAAAATCAT 


TACATATACA 


TTTTG TAGGT 


CGAGAGTAAG 


6001 


G TATTTAAC A 


TAAAATCATT 


TTAGTATATC 


AGTGTTTATA 


TAGACTTAGG 


6051 


TTTTTCTCAT 


TTAAAACCTC 


TTTTAATGAC 


TTGTGCTTTT 


CTTCATGGTA 


6101 


ATAAAACATT 


TTCCCAGGAA 


GTGCTG AATA 


AATCTTTCTT 


GAAATACGTT 


6151 


TTATTGCTTT 


CTATCAATGA 


CCCTGAAGTA 


ATACAGAATT 


T AC AC TTC AG 


6201 


CGGTTGCAAT 


GCTCAAACTT 


GACAGGTAAT 


GCACTGTGTT 


TGCTGATATA 


6251 


AGAGGTATGA 


TGTAGGGCTA 


AGTGG TTTTG 


TGCTCATTTA 


GCTTTCAGGA 


6301 


GAAAATAATT 


GACTTAACAT 


TTTGATACTA 


AAACCCAAAG 


CCTAACAGTT 


6351 


AATTCTTGGT 


ATTTTAAATT 


ATTATTGCAA 


AGATTATTGT 


GCCGAATAAT 


6401 


ATGAAAATAT 


TTTATATAAT 


ATTTAAAAAG 


TATATCTCTT 


TCTTGGTATT 


6451 


ATTTAAATTA 


CCATAAAAAT 


GTGCGAAAAA 


GTTATACTGA 


AATGTGATAG 


6501 


GATCTTTTAA 


AAGTGGTGCC 


TTGATTTTGT 


TAAGTGTTAC 


CTAGTTTTCC 


6551 


TCTGAAAACA 


AGAAACATAC 


CCAGAAGTTT 


TCACGAAATG 


GTCTCATGAA 


6601 


TATCTAAGGT 


TAGTCCGTAG 


TCTCATCTGA 


GACAAGGAAA 


GTCCCTTCCA 


6651 


CTATGAGCCT 


GTAAAATCAC 


AAGCAAGCTA 


GTTACTTCCT 


AGATACAATG 


6701 


GGAGTACTGG 


TATTGGGTAA 


ACACAGCTGT 


TTCAAATGGG 


AGAAATTGGC 


6751 


CAAAATTAAT 


GGGTTACAGG 


GCATGCAATT 


CCGAAATCCA 


TCTGGGCAGT 


6801 


CAAATTGTAA 


AACTCCAAAA 


TGATXTCTTT 


TGACTCCATG 


TXTCACATCC 


6851 


AGGACATGCT 


GAXGCAAGAG 


ATAGGTTCCC 


ATAATCTTTG 


GCAGCTCTGC 


6901 


CCCTGTGGCT 


TTGCAGGGTA 


TATCACCCCT 


CCCAGCTGCT 


TTCACAGGCT 


6951 


GGCATTGAGT 


GTCTGTGGCT 


TTCCCAGGAA 


CAAGGTGCAA 


GCTGTTGGTG 


7001 


GATCTACCAT 


TCTGGGGTTT 


GGAGGATGAT 


GGCCCTCTTC 


TCATAGCTCC 
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7051 


ACTAGGCCGT 


GCTCCAGTAG 


AGACTCTGTG 


GGGGCTCTGA 


CCCCAGATTT 


7101 


CCCTCCTGCA 


CTGCCCTAGC 


AGAGATTCTT 


CATGAGGGCC 


GTGCCCCTGC 


7151 


AGAAAACTCT 


TTCCTGGGCA 


TCCAGGCATT 


TCCATACATC 


TGAAATCTAG 


7201 


GTGGAGGTTC 


CCAAACCTCG 


ATTCTTAATT 


TCTGTGCACC 


TGCAGGCTCT 


7251 


CTACCACGTG 


GAAGCTGCCA 


AGGTTTGGGG 


CTTGCACCCT 


CTGAAACCAC 


73.01 


AGGCTGAGCT 


ATACCTTGGC 


CCCTTTTAGC 


AATGGCTGGA 


GTGACTGGGA 


7351 


CACAGGGCAC 


CAAGTCTCTA 


GGCTGCACAC 


AGTATGGGCA 


CCCTGGGCCC 


7401 


AGCCCTCAAA 


ATCATTTTTT 


CCTCCTAGGC 


TTCTGGATCA 


GTGAAGGGTG 


7451 


GGGCTGCCAT 


GAAGACCTAT 


GACATGCCCT 


GGAGACATTT 


TCCCCATTGT 


7501 


CTTGGGGATT 


AACACTGGCT 


CCTTGTTACT 


TATGCAGATT 


TCTGCAGCCA 


7551 


GCTGAATTTC 


TCCTCAAAAA 


ATGGGTTTTT 


CTTTTCTACT 


GCATTGTCAG 


7601 


GCTGCAAATT 


TTCTGAACTT 


TTATGCTGTT 


TCCCTTTTAA 


AATGCGATGC 


7651 


TCTAACAACA 


CCCGTCACCT 


CTTGAATGCT 


TTGCTGCTTA 


GAAATTTCTT 


7701 


CTGTCAGATA 


CCCTAAATCA 


TCTCTCTCAA 


GTTCAGAGTT 


CCACAAATCT 


7751 


CTAGGGCAGG 


GGCAAAATGC 


CACCAGTCTC 


TTTGCTAAAA 


CATAACAAGA 


7801 


GTCGCCTTTG 


CTCCAGTTCT 


CAGCAAGTTC 


CTCATCTCCA 


TCCGAGACAA 


7851 


CCTCAGCCTG 


GTCCTTATTG 


TTTATATCAC 


TATAAAAATT 


TTTGTCAAAG 


7901 


CCATTCAACA 


AGTCTCTACT 


CCAAACTTTC 


CCACATTTTC 


CTGTCTTCTT 


7951 


CTGAGCCCTC 


C AAATTG TTC 


CAGCCTCTGC 


CTGATACACA 


GTCCCAAAGT 


8001 


TACTTCCACA 


TTTTTGGATA 


TCTTTTCAGC 


AATGCCCCGC 


TCTACTGGTA 


8051 


CCAACTTACT 


TTGTTAGTCC 


G TTTTC AC AC 


TGTTGATAAA 


GACATACCCA 


8101 


AGACTGGAAA 


GAAAAAAAGG 


TTTAATTGGA 


CTTACAGTTC 


CACATGGCTA 


8151 


GGGAGGCTTC 


ACAATCATGG 


CAGGAGGCAA 


AAGGCATTTC 


-TTACATGATG 


8201 


GCAGCAAGAG 


AAAATGAGGA 


AGATGCAAAC 


GCAGAAATCC 


CTGATAAAAC 


8251 


CATCGGACCT 


TGTAAGACTT 


ATTCACTACC 


ACTAGGACAG 


TATGGGTGAT 


8301 


ACCACCCCCA 


TGATTCAAAT 


GATCTCCAAC 


CAGGTGCCTC 


CCACAACACA 


8351 " 


TGGGAATTAT 


GGGAATACAA 


TTCAAGATGA 


GATTTGGGTA 


GGGACACAGA 


8401 


GCCAAACTAT 


ATCACATGGA 


TTTCTTATAC 


TTTTGCTTTT 


AATAACACAA 


8451 


ACAAAAAAAT 


ACATCATTAA 


AAGGTTAGAA 


GTGAGAAGGT 


GTTTTTATGG 


8501 


AAATCAAAAA 


TAATATCACC 


TTAGTGAACA 


GTATTCTTAT 


GATTGTAGTT 


8551 


GAATTAGAGA 


GCAGAATACA 


TCTAGAAGAT 


TCAGTAGTAA 


GCATGTTTCT 


8601 


TCGATTAATG 


GAAAATTTGA 


ATAGCCTAGC 


TGATTGAGAT 


TGAGGTTACT 


8651 


ATTAAATGCC 


TGAAGTATAA 


GAGTTGGTTG 


TTTATG TAAA 


CAAAATATCT 


8701 


GTTTTACATG 


TACATGTGTA 


AGTAGGACTG 


TTGAGCCCCA 


GTAACATGAA 


8751 


ATATCAAAGA 


GCATGACTCG 


AATACCTGCC 


ATATGAAGTG 


CTATTACATC 


8801 


AAAAAAGAGG 


CGTGTGCTGA 


AAAATTACCT 


ACAAATGGCA 


TTTTCCTCAA 
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8851 ATCAATTTTA AATCTTCAGA ATTTCATTTT AATAATTGTT TAGTTAATAT 
8901 TTCAGAATCC CTCATCATAA AAAGCAGGCA AAAGGCAAAA GTCCTTGAAT 
8951 G TATAAC AC A TTTGTTTTCA AACAAGCCTG CCTCTAACTG TGAATCCAGG 
9001 AGTGAATCCA GAACTACAAA TTAACTAAGA TTGGCCCCAT CGAGTTACTG 
9051 AACGTTAAAA ATCTAAAAAC TAAAAGGCAT GCCTCAACAA TTATTTTCTT 
9101 CTTGGAATCA TTAATTAACC TATGTGTATC CAAACAATAA TCTTCCAGCA 
9151 GTTTCGCTAG CTACATTTTT AATTACTTAA TATCATGTAA AATTTG TTTT 
92 01 ATTATTGTTC AGTTCTGAAT TTTGACATAT GCATCAAGCC ATGCAACTGC 

92 51 TACCACAGTC TTCCTGATCA CTGATCTGTT CTAAATCTCT ATAGCATTTT 

93 01 TCCTTTTCTT AAATGTTGCA TAAATAAAAC CATACCTTAT GTGGCCTTTT 
9351 GAATCTGGCA TCTTTAACTT AATGCGCTTG AAATTAATCT ATGTCATTTC 
9401 ATGTATCAAT GGCTCAATCT TTTTAATTGT TAAGAAAAAA TGTATGCTGG 
9451 GATAAATATC TTTCTAAATG AGTTTTTGTT CACAATGCTG AGTGTTTGTT 
95 01 TAGGATAGAG TCCTAGAAAT GGTATCACTA GGTCAAACAT TCAAATAATT 
9551 TTAAAATATT TGATACATAT TGCCAAATAA TCTCAAATTT TTTACCAATA 
9601 TACATTTATG ACAGTATGGG ATAAATGTGT CTTTCTTATA CCAACTGACA 
9 651 ACATTAATGA TAATACATAA AATATTCTTT GCTAATTTGA TGGGACAGAA 
97 01 ATGTTATATC CTTATTAGCA TTTTATTATT GTGGTTGAAT GACTGTACTG 
9751 TACAGCCAGA GATATTTGGT TCAAAATCCA TCTTCATTAT TTACTGTATG 
9801 TGAAAATTTA GGTGAGCTAT TTAATCTCTT GATGCCTTAG TCTCCTAATC 
9851 TATAAAGTGG GGATAATTGT ACCAATCATA TTAGGTTCCT GTGAGAATTA 
9901 ACTGAATTAC TATAGAAAAT GCTTAGAATG GTATCTAGTC ACCAGGAAGG 
99 51 ACTCTCTCTG TATTACTTGT TTATTATCTA ACACGTTTAA TTATTAATGA 

10001 AGCTCAGTTT CGTTATATGC TTGGGATATT TGAAACTTTT CTTAGTGAAT 
10051 TTTCCAATAA AATTATTTGT CTATTTTTCT ATGGACAAGT TGGTATTATT 
10101 CTTACTGGTT TGTTTCAGGT TCAGTTAGTA AGAATTTTAA GGATTTTCTA 
10151 TCACATTTTA GCAAACTTTT TCTGCATTTT ATCTTTTTTC TTTCAGATAA 

102 01 TG TTTGC AAA ATGTAAAAAA AACAAAAGGT TTCTTCATCA AGTTGGTATC 
10251 TTTATCTTTT TTATTGCTTT GTGATTTGAA AATTCTTGTC CTGAGAACCA 
10301 AAATATATAT TTGATGAAAT AGTTCTCTTC TTTTACTCAT TCTGAAGTCA 

103 51 TTGGAATTGA ATTTGGCATA TGATATAAAT CCTAATTTTA TATTTTATGA 

104 01 TATTCAAAAT TTCTAACAAA TATTTACTTA ATAATCTAAT CCAGGTTTCT 
10451 ATTGTTTCTT CTGTTTCCTT TATAATGCTT TTTCTGAAGT TATTTTTCCT 

105 01 AGACTTAAAT ATTAGTATAA TATTATCATA GAGGAAAAAA TATCTGTTAG 
10551 CTATGAATAA AAGGCTTTCA TCTTATTGTT GCATTAATAT ATTTAAATGT 

Fig. 2 (cont'd 5) 
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10601 AGAGAGCATA CAGATTAGCA AAGAAAAAGT ATAATTGCCT TTTTTTATAG 
10651 TTGACATGAA CATGTATAAA GAAAAACCAA AAAAATCAAT AAAACAACTA 
10701 GAACTTATTA GTGAATTTAG CAAGATCATA GCATACAAAG CCAAGATTCA 
10751 AAATTCCATT TTATTTATCT ACTAACAAAA AATATTTGAA ATTTGAAAAT 
10801 TTAAATATGC CATTTACAAT AACATCAAAA TATTGAACAA TAAAGTATTT 
10851 AGGAATTTAT AAAATGAAAT CTCCTATACC AGGAATTACA GACCATTGCT 
10901 GAAATAAATG AAAGAAGACC AATATATGTG AAGAGATACT CATTTGTGGA 
10951 TTGAGAGACA ATATTGTTAA AGTATCAGTA TTTCCCAAAT TAATCAATAG 
11001 ATTCAATATA ATGGTGAACA GAACACCAGA AGATGTTCTG TCGAAGCTGA 
11051 CAAGCTATTT CTATAATTCA AATGGAAATG CAAAAGGCAG TCACTGCCAA 
11101 CACCAGCATG GACTGTCTGG GTTCCAGTAG GTTACTTCAC TACTGCCTCT 
11151 TCTGTCAGCC ACATCACGAC AGCTGCCCAG AAGCCAGAGA AACTCCTCAC 
112 01 ACCTGGCCCA CTGCTGCAGC TACCAGCATC CAGGCAAGCC ACCATCAGCC 

112 51 CACTGGTAAC TGCCAACAGA GGTACCACTG TACACTACCC TGGGGAACAA 

113 01 AGATAGGCAT GTAGTCAGCC CACCTCTGCC ACCACTAGGG CCTGAAGCCT 
113 51 GGCCCACCTG ACACTGCAGT CCTCAGCACA GCTTCATCAC AGCTTCTGTT 
11401 AATAACCACA CCCTAACCTA CCAAGGAAAT CACAAATGTC AC TG AC AC TG 
11451 TTTGTAGCCA AAGAAATCAT AGAGAGACTA CATTACTGCA CACACCCATA 
11501 ATCAAAGCCA CAGTACCCTA TCCAGACAAC ATCACAGGTA TATCTAAAGG 
11551 AAAAAATTTT C C C ATATGAA AGCGAATTCA AATATAGGAA GAAGCGACTG 
11601 TTACAACAGA TATGCAGATA AAGCTTCAAC AATATCCTAC ATTCAACCAG 
11651 AAGAAAGAAT CTCAGAAGGT AAAGACAGGT CTTCTGAAAT AATCTAGTCA 

117 01 GACAAAATTA AAAGAGAATA ATCAAATCCT TC C TG AC ATT TGGGATAACA 
11751 TTAAAGTGAC CAAATATACG AATTATAGAT ACCCCTGAGA GTGAAAAGAC 

118 01 AAAGAAAAGA TTAGAAAACC CACTTAATTA AATAATATAT GAAAACTTCC 
118 51 TAAGTCTAGC AAGAGTTTTA GATATTTGGG ATGCAGGAGG CTCAATGGTC 
11901 -CCCAGGCCGA TAAAACGCAA AAAGGTCTTA TACACAGCAC ATTACAATCA 
11951 GACTGTTTAA AGTCAAAGAT AAGGAATAAA TTCTAAAAAC AGCAAGAGAA 
12001 AGTGTATGAT AACCTATGAA GTAAACCTTA TCAGACTGAC AGCAAATTTC 
12 051 TGGCAGAAAC TTTACAGGCC AGAAAGAATA GGACAATATA TTCAAAGTGC 
12101 TTAAAGAAAA AAAAAACTAT CAGCCTTAAA TACTATAGCC CACAAAATTA 
12151 TCCTTCATAA ATGAAGGAGA AATAAAAGGT TTCCCAGACA CGAAAATGCT 
122 01 GAGGTAGTTT GTT AC TACT A GACTGGACCT ACAATAAATG CTCAAGGGAG 
122 51 GTCTGGAAAC TGGTAGTGAA AGGACGACAT TTATCATCAT GAAAATACAT 
12301 GA^AGTATAA AACTCCCTGG TAAGCAACTA AAGGGAGGTA TCAAATGTTA 
12351 CCACCAGAGA AATCTAACTA ACCACAATGA CAAACAATAA GGGAAAAAGA 

Fig. 2 (cont'd 6) 
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124 01 AAGGAACAAA AATATATAAG ACAACAAATA AACAACAATA TAACAGGAAG 
12451 CCTCACATAT CAGTAATCAC TTTGAATGTA AATGAATTAC ATTCTCCACC 
12501 TAAACGTTAT GAAATGCCTG AATGATAAAA CTATATGATC CAAATATATG 
12551 CTGATTACAA GAAACTTACC AGGCAGACAT ACATAGGCTG AAAGTAAAAG 
12601 AATGGTAAAA GATATTCCTT GCAAATGGAA AGCAATAGTG AGCAGGAGTA 
12 651 GCTATACTTA AATTAGATCA TACAGACTTT AAGTCAAAAA GAGTAAAATA 
12701 AAAAAGACAA AGGATGTTAT TATATAATGA TGAGATTAAC CCAGCAATGG 
12751 GAAATAACAA CTCTAAATGT ATATGCATTC AACACTAGAG AACTCAGATC 

12 801 CACAAAGCAA ATATTAGACC TAAAGAGAGA AATAGACTGC AATACAGTAA 
12851 TAGTGGAGAA CTTCAACACT CCACTTTCAG TATTAGACAG ATAATCTAGG 
129 01 CAAAAAATCA ACCAGTAAAT TTTAGATTTA AACTAGATTT TAGACCAAAT 
12951 GGACCTAACA GACATTTACA AAACATTCCA TCCAACCACT GCAAAATGAA 

13 001 ATTTGTGTCA TCAGCACATG AAACAATGTC CAAGATAGAC CACCATATGT 
13 051 TAGGCCACAA ATCATGTCTC AGCAATTTTT TAAAAGTTGA AATCATATCA 
13101 CATATCTTCT C AG ACC AC TG TTGAATAATG CTAGAAATCA ATGCCAAGAA 
13151 TAACGTTGGA AACTATACAA ATACATGCAG ATTAAACAAC ATGTTCCTGG 
13 2 01 TTGATCACTG GGACAATAAG GAAATTAAGC TGAAAATCAA AAAATTCTTG 
13 2 51 TAACAAATAA AGATTGAAAC ATAACATATC A^AACCAGTG GCATACAGCA 
133 01 AAAGCAGTGC TAAGAGGGAA GTTTATAGCA ATAAATGCTT ACACTGAAAA 
13351 AGTAGAAATA TTTTAAAATT AGCAACCTAA CAATGTGCCT GAAGAAACTA 
13 401 AAAAATCAAG AACAAATCAA ACCCAAAATC AGCAGAAGAA ACACAAAAAT 
13451 AAAGATCAGA AAAGAACTAA ATCAAATAGA GACTAAAAAA ATACAAATGA 
13 501 TTAACAAAAC TAAAATTTGG TTATTCAACA AGATAAATAA AATTGATAAA 
13551 CCGCTAGATA GACTAAACAA GGAAAAAGAA TATCCAAATA AACACAATCA 
13 6 01 AAAACGATAA AGGAGACATT ACAACAGATG CCACAGAAAT AAAAAGGATC 
13 651 ATCAGAGACT ATTATTAACA ACTATATGCT GAAAAATGGA AAATATAGAG 
13 7 01 AAATAGATAA ATTCCTAGAA ACTTACAACC TACCAAGCTG TTGCATCAGG 
13 7 51 AAGAAATAGA AAACCTGAAC ATATCAGTAA TGATTAGCAA AATTGAATCA 
13 801 GTAATAAAAA ACATCTCCCA ACTCTTTTAA AGCTTTGGAC CAAATAGCAT 
13 851 CACAGCCTAA TTCTACCAAT CATGCAAAGA AGAATACCAG TCTTCTTGAT 
13 901 GCTATTACAA TAAATCAGAG GAAGGAATTC TCTCTGGCTC ATTCTACATG 

13 9 51 ACCAGTGTCA CCTTGAAACC AAAACCTGAC AAGGACACCA CAAAAAGAAA 

14 001 ACTACAGGCC AATAAC C ATG ATGAACACAG ATGCAAAAAT CATTAACAAA 
14 051 ATACTGGCAA ACGGAATCCA ACAGCACATC AAAAAAATAA TATACCACAA 
14101 TCCAGAGGGT TTGTATCAAG GATACAAGTA TGACTCAATG TAAATAAATC 

Fig ' 2 (cont'd 7) 




13/ 124 

14151 AATAAACATG ATAAGCATCT TCACAGAATA TAAGACAAAT GAATATATGA 
14201 TCATCTCAAT AGATGCAGAA AAAAATTTTT GATAAATTTC AACATCTCTT 
14251 CATGAAAAAA ATCTCTAAAA CTCAGCATAG AAGAAACATA CCTCAATATA 
143 01 ATAAAGGCCA TATGTGACAA ACTCAGAGCT AATATCATAC AGAATGGGGC 
14351 AAAGTTTAAA GACTTTCCTC TAAGAACTGG AACAAGACAA GGATGCAAAC 
14401 TCTCACCACT CCTATCCACA TAGTACTAGA AG TCC TAG CC AAAACAATCA 
14451 GACAAGCAAA AGAAATAAAA AGTATCTAAA TTGAGAAGAG CAAGTAACAT 
14501 TGTTCCTCTT TGCTGATGAT ATGGTTTTGT ATCTGGAAAA TACTAAAAAC 
14551 TCCAGCAAAA ACCTCTTAGA TTTGATTAAT TAATTTAGTA AAGTTTCAGG 
14601 ATACAAAATA AAAATACAAA AGTCAGTAGC ATTTCTATGC CCCAATAATA 
.14651 AAATAGCTAG GAAAGAAATC AAGAAAGTGA TCCCATTTAA ATTAGCTACA 
14701 AAAAATTAAA ATACCTGGGA ATAAATCAAG GAAGTTAAAG ATCTCTGCAC 
14751 AAAACTACAA AACACTGATG AAAGAAATTA AGGATTAAAC AAACAAATTG 
14801 AGAAACATCC CATGTTTATG GATCAAAAGA ATTAATATCA TTAAAATGAC 
14851 CATACTTCCC AAAGCAATTT CCACATTCAA TGCAATTTCT ACCAAATTAC 
14901 CAATGTCATA TTTCATAGAA TTAGAATAAT CCTAAAATTA GTATGGAATG 
14951 AGAACAGAGC CCAAATAGCC AAAGCAATTC TGAACATAAA GAACAAATCT 
15001 GGTCCTGACT TAATCACTAT GCAATCTATG CATGTAACAA AATTGAACAT 
15051 GGATTTTATC AATTTGTACA AATAAAAAAA TGTAAAAAAA GAACAAAGCT 
15101 GGAGGCTATA GTAGCCAAAA CAGCATGGTA TTTTTAGACA AATGGAATGG 
15151 AATAGAAAGC TCAGAAATAA AGCCATATAT ATATATTGTG TGTGTGTGTG 
15201 TGTGTATACA CACATACATG TATATATAAT GTGTACATAT AATGTTTTCT 
15251 ACATGTTCTA ATATTTATAT TCCATTCCAT TATACATATT CCATTTCTGT 
15301 ATATAGGTTA TATAGAATTG GAAGACTATC TGCCATTAAA AAGAATGAAA 
15351 TCCTGTGATT TGCAGCAACA TGGTTGAAAC TGGAGTTCAT TATCTTAAGT 
15401 GAAATAATCT AGGCACAAAA AGATAAATAT CACATGTTCT CACTTATATG 
15451 - TGGGAGCTAA TAACTTGATT ACATGAAGGT GGAGAATGGA AAGGTAGGTA 
15501 GGAAACAGAG ACTGGAAAGG ATGAATGGAG GGTAGGAGGG AAGGTGAAGA 
15551 GAAGAGAGTT AAAAGGTGTA AACATATAGT TAAAAGAAAT AAATTCAATG 
15601 CTTGATAGCA GAGTACAGTG ACTACAGTTA ACAAAATGTA TTATACTCAG 
15651 GTGATGAACA CCTAAATACT TGATCACTAT GCAATTATAT ACGTGTAACA 
15701 AAATCACTAT GCACTATATA CGTGTAAAAT TAAATGCGTA CAAATAAAAA 
15751 TAATAAAATA CTAATCCAGT ATCATTCACT GACAATGTTA ACTCAGGTGG 
15801 ATAGGCATTA AGTCAATACT ACTATAAGAA CCACTTCTTG TTTATGTTAA 
15851 TGCCATATAG AATGAAATAA AATTCACTAA AATCCAAAAA ATTAGAAAAA 
15901 CTATCAAAAC TCAATAATAT TAAGACAACC CAATAAAAAT GTGGTCAAAG 

Fig. 2 (cont'd 8) 
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15951 


GATTTGAACA 


TACATGTCAC 


CAAAAAATAT 


ATTCAAATTT 


CCAATAAATA 


16001 


CATGTAACAA 


TGTTCGACAT 


CGTTAGTCAT 


CAGAGAAATA 


CAAAATAAAA 


16051 


TGGTAATGAG 


ATACTACTAG 


ATAGGCTTTT 


ACAGAGACTG 


ACAATACCAA 


16101 


GTATTGACAA 


GGATATGGAG 


CAACTGAAAT 


TCTCATTCCT 


TGTGGTAAGA 


16151 


ATGTACAATT 


ATATAACCAC 


ATTGAAAAAA 


CAAGTTTTCA 


GTTTCTTTAT 


16201 


TCACCCAAAA 


TATATGTCTT 


TTGGAAAAAA 


TTTTTTCCAG 


TCTGTGGGTT 


16251 


GTCTTCTCAT 


TCTCTTGATA 


TATGTCTTTT 


CAAAGAGGCT 


GAGCTTTACT 


16301 


TTAGACAGTG 


GTCATCAAAG 


TGTGTATATT 


TGTGTTTTTA 


TAATTTATAT 


16351 


GCATATATTC 


CTGTGAAAAG 


ATACTGTATG 


CATTGTTCAA 


CATGTACAAA 


16401 


TATAAGAAAG 


ATATAGTAAA 


GAAATATATA 


TTTCTAAATT 


TATAAATGTA 


16451 


TTTATTGGTG 


TTCCACGTTG 


CAAACTAAAT 


AATCTACGTT 


GGCTAATTTA 


16501 


AGGAATTAAA 


CTATAGTAGA 


AGGTTCTCAT 


TTATTGGGAT 


GATTAGAACC 


16551 


AGCCTTTTTG 


CAGGCTATTA 


GCGAATCATA 


GCACTAGGGC 


TTCACTGCTA 


16601 


CCTCCACTGA 


CACCTCTGAC 


ACTTGAAACT 


TGAGGCCAGA 


TATCTGCCCA 


16651 


TGCTGA'TAGA 


AAACAACTGA 


ATAATTTAAT 


TTGCTAGATA 


ATAGAAAAGA 


16701 


ATCAAATGAC 


TCTGCCACAT 


TGCTTGCCAG 


AAGATTGTTT 


TTCTCATTTG 


16751 


TGACCTCTTG 


CCTATAAATG 


AT AG AT AG TC 


CCTGTGCTGC 


ATGCTATAGG 


16801 


TGTTCGTAAG 


AGAGTCTGGG 


AATGTGAGCT 


TTTTATATCC 


TATTTTTGGG 


16851 


TGGTAAAGGT 


CATTCTATTA 


GTCTGTTCTT 


AAACTGCTAA 


TGAAGACATA 


16901 


CCCCAAATTG 


GGTACTTTAT 


GAAAGAAAGA 


GGTTTAATTG 


ACTCACAGTT 


16951 


CAACATGACT 


GGGGAGGCCT 


AAGGAAAGTT 


ATAATCATGG 


GGGAAGGGGA 


17001 


AGCACACATG 


TCCTTCACAT 


GGTAGCAGGA 


AGGATAATGA 


GTAAAAGGGG 


17051 


GAAAAGCCCC 


TTATAAAACT 


ATCAAATCCC 


ATGAGAACTC 


ACTCTCACAA 


17101 


GAACACAATT 


AGAGTAACTG 


CCCCCATGAC 


TCAATTACTT 


CCCACCAGGT 


17151 


CCCTCCCACA 


ACACATGGGG 


CTTATGGGAA 


CTACAATTCA 


AGATGAGATT 


17201 


TGGGTGGGGA 


CACAGCCACA 


CCATTTCATT 


CCACCTCTGA 


CCCCTCCCAA 


17251 


AT.CTCGTGTT 


CTCACAATTC 


AAATACAATC 


ATGCCCTTCC 


AACAGTCCCC 


17301 


CCAAAGTCTT 


AACACATTTC 


AGTATTAACA 


CAAAAGTCCA 


AGTCCAAAGT 


17351 


CTAATCTGAG 


ACAAGGCAAG 


TCCCTTCTGC 


CTATGAGCCT 


GTAAATTCGA 


17401 


AAGCAAGTTA 


GCTACTTCCT 


AGATACAATA 


GGGTCACAGT 


CATTGGGTAA 


17451 


ATACACACAT 


TCCAAACGGG 


AGGAATTGAC 


CAAAACCAAG 


GGGCTACAGG 


17501 


CCTCATGGAG 


GTCCAAAATC 


CAATAGGGCC 


ATTGTTAAAC 


CTTAAAGTTT 


17551 


CAAAATTATC 


TCCTTTGACT 


TCATATCTCA 


CGTCTAGGTC 


ATGATTATGC 


17601 


AAGAGGTGGG 


CTCCCACAGC 


TTTGGGCAGC 


TCTGCCTCTG 


TGGCTTTGCA 


17651 


GGGTACAGCC 


CCACTCCAGG 


CTGCTTTTAC 


AAGCTAGTGT 


TGAGTGCCTG 



Fig. 2 (cont'd 9) 
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17701 


CAGCTTTTCC 


AGGCACATGG 


GTGCAAGCTG 


TAGGTGGATC 


TACCATTCTG 


17751 


TGGTCTGGAG 


GATGGTGGCC 


TTCATCTCAC 


AGATCCACTA 


GGCAGTACCC 


17801 


CAGTGGGGAC 


TCTGTGTGGG 


GGCTCTGATC 


CCACATTTCC 


CTTCCACACT 


17851 


GCCCTAGCAG 


AGGTTCACCA 


TGAGGGCTCC 


ACCCCTGCAG 


CAAACTTCTG 


17901 


CCTGAACATC 


CAAGCATTTC 


CTTACATCCT 


CTGGAATCTA 


GGCGGAGGTT 


17951 


TCCAGACCTC 


AATTGTTGAC 


TTCTCTGCAA 


ATGTAGGCTC 


AACACCCCAT 


18001 


GGAAGCTGGC 


AAAGCTTGGG 


GCTTTCACCT 


TCTGAAGCCA 


TGGCCTTAGC 


18051 


TGTACCTTGG 


CCCTTATTAG 


TTAAAGCTGG 


AGCAGCTGGG 


TTGCAGGGCA 


18101 


CCAAGTCCCT 


ATGGTGCATA 


CAGCAGGGGG 


GCCCTGGACC 


CAGCCCACAA 


18151 


AACCAATTTT 


CCCTCCTAGG 


CTTCTGGGCC 


TGCGATGAGT 


AGGGTTGCCA 


18201 


CAAAACTGTC 


TGACATGCCT 


TGGAGACATT 


TTCCCTATTG 


TCTTATTAAG 


18251 


ATTTGGCTCA 


TAGTTACTTA 


TGCAAATTTC 


TGCAGCAGGC 


TTGAATTTCT 


18301 


CCTCAGAAAA 


TGAGTTTTTC 


TTTTCTATGG 


CATCATCAGG 


TTGCAAATTT 


18351 


TTAAAACTTT 


TATGCTCTGC 


TTCCCTTTTA 


CAATTAAGTT 


CCAATTCCAA 


18401 


ACCATATCTT 


TCTGGATACA 


TAAAACTGAA 


TGC TTATAAC 


AGCACCCAAA 


18451 


TCATATCCTG 


AACACTTTGC 


TTC TC AG AAA 


TATCTTCTAC 


CAGATACCCT 


18501 


AAATTATCGC 


TCTCAAGTTC 


AAAGTACCAC 


AGATCTCTAG 


GGCAGGGGCA 


18551 


AAATGCCACC 


AGTCTCTTTG 


C TAAAGC ATA 


ACAAGAGTCA 


CCTTTGCTCC 


18601 


AGTTCCCAAC 


AAGTTCCTCA 


TCTCCATCTG 


AG ACC AC CTT 


AGCCTGGATT 


18651 


TCATTGTCCA 


TATCATTATC 


AGCATGTTGG 


TCAAAGCCAT 


TCAACAAGTC 


18701 


TCTAGGAAGT 


TTCAAACTTT 


CCCACATCTT 


CCTATCTTTT 


TCTGAGGCCT 


18751 


CCAAACTGTT 


CCAACTTCTG 


CCTGTTACCC 


AGTTGCAAAG 


TTACTGCCAC 


18801 


ATTTCTGGGT 


ATCTTTACAG 


CAGTGCCCCA 


CTCCTGGTAC 


CAATTTACCA 


18851 


TATCCATTTA 


TTCTCATGCT 


GATAATAAAG 


ACATACCCAA 


GGCTGGGTAG 


18901 


TTTATAAAGA 


AAAAAGAGGT 


TTAATTGACT 


CACAGTTCAG 


CATGGTTGGC 


18951 


AAGGCCTCAG 


GAAACAGAAT 


CATGGTGGAA 


GGGAAGCAAA 


CACATCCTCC 


19001 - 


TTCACATGGT 


GGCAGGGAGA 


AGAATGAGCA 


AAACGGGGGA 


AAAACCCTTA 


19051 


TAAAATCATC 


AGATCTCATG 


AGAACTCACT 


CTCTTGAGAA 


CAGCATGAGG 


19101 


GTAACCATGT 


CCATGATTCC 


ATTACCTCCC 


AACGGGTTCC 


TCCCATGACA 


19151 


CGTGAAGATT 


ATGGGAACTA 


CTACAATTCA 


AGAGGAGATT 


TGGGTGGGGA 


19201 


CACAGCCAAA 


CCATGTCAGT 


CATGATATGA 


GAAATTATCA 


AATTAAGATG 


19251 


TAGGGAAGGT 


TTTTAAAAGA 


TTTGAGCAAC 


CACAAATGAC 


AGATATGTGC 


19301 


TATAGTAGTG 


CAAAATACCA 


TTTTGCTCTT 


ATTAAAAATA 


TAATTGTTCT 


19351 


TGATAATCTG 


AATTATAAAT 


GTCATGGATA 


ATTATGATGC 


ATTATGCTCT 


19401 


CAGCAGCTAA 


AACTTCAAGC 


AAAATACACA 


CCTAGAGAGC 


AATCAGCCTT 


19451 


AACAATAATT 


CTATAAATTT 


AATTTTCTTT 


ATTTCTGATA 


ATTACATTTT 
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19501 


AGTTGACTTC 


ATATGTGATC 


TAAATACATT 


ACCATTATTT 


TGGACTTATG 


19551 


ATGTAGCTCT 


TGAAGTACAT 


ATATGATGTA 


GCTCTTAAAG 


TACATATAGA 


19601 


AGAGCAGATA 


AAGTATCAGT 


TCACCATTTC 


TTTGTAGTTT 


GTGCTTTCAT 


19651 


GATGAATATT 


CTCATCAATG 


TACAGATTAT 


TTGCAGGAGC 


CTTTTAAATC 


19701 


CATGTGTCCA 


TTTTATGAGA 


CTTAGCTTTT 


GTCTGTATAT 


AATGTGTTTA 


19751 


TTCAGTGTGC 


ATGGATTAAT 


TTGAGAGAGC 


ACAGGTATGG 


GTATCTTTAC 


19801 


AGCAGTGCCC 


CACTCCTGGC 


ACCAATTTAC 


TGTATTAGTT 


TATTCTCATG 


19851 


CTACTAATAA 


AGACTATATA 


TCACAATAAA 


CTGAGAACCA 


GCTGGTAAAT 


19901 


GAGAGAACTG 


TGGTCCACCT 


TTTCATTGTG 


GAGTTCTCAT 


TTTCCTTAGC 


19951 


TTATGCTGCT 


TATTCAACAC 


TATTTCTGCA 


TAATCTAATG 


CATTCACTAA 


20001 


ATGAAGGTGC 


TGTGTTAGCC 


TCCACATGAT 


ATTAATACAG 


CCTATTTAAT 


20051 


TTATCCTTCT 


TTAGATTAAA 


AATAAATAAG 


TAGTCATGTG 


CCACAGAATG 


20101 


ACACTTCAGT 


CATTTGGTCA 


TTGAAGGACC 


ACATCTATTA 


CTGTGGTCCA 


20151 


ATAAGATTAT 


AATAACATAT 


TTTTCCTGTA 


CATTTTCATT 


GTTCTGATAT 


20201 


GTTTTGATAC 


ATAAATGCTT 


ACCATCGTGT 


TAGAGTTGCC 


TGCAGTATTC 


20251 


AGTACAGTAA 


CATGCTGTAC 


ACCTAGGAGC 


AACAGGCTAT 


ACCACATACC 


20301 


TTAGGTGTAT 


AGTTAGGTTA 


TACCATCTAG 


GTTTGTATAA 


GTACACTCTA 


20351 


TGATGTTCTC 


ACAATGAACA 


AAATCACCTA 


ATGATGCATT 


TCTCAAAACA 


20401 


TGTCCCTGTC 


ATTAATACAG 


TATGTAACAA 


TACAGTTAGT 


ACAATATGTA 


20451 


ATACATGACT 


ATATTCAGAA 


TTTTAGCTAT 


TTCTCTTATA 


TTTCAAATGG 


20501 


ATTTTCTTAT 


GCACTGTGTG 


GCACGGGCAT 


TTCATTTTAG 


TAACCACAGT 


20551 


CTGGGAAAGG 


AGAAGTCTTT 


GAAGGATGTT 


GAGCAAGGTT 


ATGACATGGC 


20601 


CAGATGTGAA 


TTTTTGATCA 


GTGACTCCAT 


GTTAGCAGAT 


AAAGTTGTAT 


20651 


TGGGAAAGAT 


CAAAAGCATG 


AAGGCCAGAT 


AAGAGGATAC 


TGTATGTTAT 


20701 


CATGGATGGA 


AATGTGAGGG 


ATGGCAGGAG 


AGATGCTATG 


ATTGAATGAA 


20751 


TCTCAATATT 


CTTGGTGATC 


AAAGAATAAT 


GAGACTCATC 


CAATAAGACT 


20801 


CTGTGAATGA 


TTGAATGTAG 


TTCCTAAGCT 


AGGAGGAAGA 


ATGAGGAATG 


20851 


ATTTTCTGGT 


TCCTGACTAC 


AGCACAAGTT 


TTTGATTTTT 


AGAACAAAGA 


20901 


AT AAATTTG T 
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AAGGATAAAA 


20951 


AAGTCAGCTG 


TAATATTATT 


CTTTCCTGAT 


ACCATGCAGT 


ATTTGTATCA 


21001 


GTGATCTTAT 


TCATTCCACA 


CACATTCTTC 


TTGAACCTGG 


ACACTGCTCT 


21051 


AGACACTGAT 


TCTTTCCAAA 


TATCAGATAA 


GGTTATTCTT 


ACGTAGACCC 


21101 


TCAGTTCATA 


TAAATATGAT 


TTTCCCAAAA 


TGTGAAATAA 


GTG AC TTTTC 


21151 


ATAAGATATT 


TTTTAAAAGA 


ATGTCTTAAT 


AATAAATTGT 


GAATGTTGCA 


21201 


TGGAAATGTA 


GGTGACTTGC 


ATTGTGCATC 


CTGTGTTTGA 


TTCACTGCTC 
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21251 


TTGCATGTCT 


TGCCTTTAGC 


TGGGATGACA 


GCAGTTCAGT 


GAGCAGTGGT 


21301 


CTCAGTGACA 


CCCTTGATAA 


CATCAGCACT 


GATGACCTGA 


ACACCACATC 


21351 


CTCTGTCAGC 


TCTTACTCCA 


ACATCACCGT 


CCCCTCTAGG 


AAGAATACTC 


21401 


AGGTGAGAAT 


TACCACCTTT 


CTTTTTCCAG 


TGTTTCTGCC 


AGCTTTTTCC 


21451 


CCAAAATTAC 


TTAATATTAG 


ATTAAGGTAT 


AGCACAAGCC 


CTTAATCCAA 


21501 


AATTATTACA 


GAAACTGGAA 


AATGCAGAGA 


TAATAAGGAC 


TCCCTTTGCC 


21551 


ACTCCTGAAC 


CCTG AAG CAT 


CTTTCATCTT 


AGTCTTTCCT 


AAAGCCACAA 


21601 


CCCTTAGGAG 


GAGCAACAAT 


GTGCACTGCA 


GCCAATTTTG 


AATAAACAGA 


21651 


AGCAGCTTAT 


ATATATATAT 


ATATATATAT 


ATATATATAT 


ATATATGATA 


21701 


TACATTACAT 


ATTTATATAT 


ATGTAATATA 


TGTGCCATAT 


AGCCTGGTGG 


21751 


TATAGTTATC 


TATACAAATA 


TATTTATTTA 


TTGTTAATAT 


ATAGAGTATA 


21801 


TAAATATCTA 


TTTATATAAT 


AGATATTTAT 


ATATATTAAA 


TATCTATTTA 


21851 


TATAATAGAT 


ATTTATATAT 


ATTAAATATA 


TAAAAATATA 


TAACATATAA 


21901 


TAGATATATA 


TTTTATATAT 


TATATAAATA 


TATATTTATA 


TATTTAATAT 


21951 


ATTAATGATG 


AATTAC TATA 


TTTGTATAGA 


TAACTACACC 


ACCAAGCTAT 


22001 


ATGGTGTGTA 


TATATTAATA 


TATAATGTAT 


AATTCTATAT 


TAATATAATA 


22051 


GTAACATATC 


AATACTTAAT 


ATAATATATA 


TTCAATTGAT 


TACAATCTAA 


22101 


TTCAGAAAGA 


TTTATGTTGC 


CATATCTCTC 


CTTACAATAT 


CGATATGTTT 


22151 


GTTTAAAAAT 


CCAGCAATTA 


TTTTC ATAGT 


CTAATTTTAG 


ATAGTTCTTG 


22201 


ATTAATTTTA 


TATGATCTCT 


GAAATATATC 


ACTGGATCTG 


TTGTGAATGA 


22251 


TAAATCAAAA 


ATGAAAAATG 


GACATTACAT 


CATTAAGTTC 


TAGCTTGTCT 


22301 


TACTACTTCT 


TATGACATTT 


GATATAGAAA 


ATTTCTACCT 


TTCTGTAGCG 


22351 


TTTAATTGGT 


G TTTTC TGC A 


TGTATTTATT 


CTGAAATTCT 


CTAATATCTG 


22401 


CAAGTGGGAA 


TTATGTGGCT 


AAAATTAATA 


AAATGTAAGT 


GAAGGTAAAT 


22451 


CAAAATAGAA 


TCTTTGGATT 


TATCCAGTTA 


TCTGAAAGTA 


CATTTCATTG 


22501 


CCTTAATTCA 


CACTTTATAA 


ATTTTTCTAC 


ATAAAGTTTT 


TCTGTAATAT 


22551 - 


TTGTCTTTAT 


AGCTGAGGAC 


AGATTCAGAG 


AAACGCTCCA 


CCACAGACGA 


22601 


GACCTGGGAT 


AGTCCTGAGG 


AACTGAAAAA 


ACCAGAAGAA 


GATTTTGACA 


22651 


GCCATGGGGA 


TGCTGGTGGC 


AAGTGGAAGA 


CTGTGTCCTC 


TGGACTTCCT 


22701 


GAAGACCCCG 


AGAAGGCAGG 


GCAGAAAGCT 


TCCCTGTCTG 


TTTCACAGAC 


22751 


AGGTTCCTGG 


AGAAGAGGCA 


TGTCTGCCCA 


AGGAGGGGCG 


CCATCTAGGC 


22801 


AGAAAGCTGG 


AACAAGTGCA 


CTCAAAACAC 


CCGGTAGGCT 


TGTCGTTTGC 


22851 


CAGCTGTTAT 


GCAAAAGTGC 


TTTACTTTAT 


TGTTTCCATT 


CAATCTTTGT 


22901 


TTTCTCTAAC 


AATAGCATTT 


CTAAAATACC 


AAATTCTTAT 


CCATATTAAA 


22951 


CATGGAGTCA 


AATAGTTAAA 


TAGTTTTTCT 


GTCTACGTTT 


CACAAACTCG 


23001 


TCATAGAAGC 


CCAAGTAGGG 


CCTATATCTA 


GGCATTCTCT 


GGAAAGCCTC 
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23051 


CTCATAAACT 


AGGGGTACTG 


GATGCCTTAC 


CTTGCCAGAG 


TTATTTCAGG 


23101 


TAATGGGGAA 


ATAAGATTAG 


GTTGCTAAAG 


CAACAGTTAA 


GTTTTTTTGT 


23151 


TTTTGTTCTG 


CGTTCTTAAT 


GAAAGTTTGG 


AATTTTTACA 


CTAAATATGC 


23201 


CACTGAATTG 


CACTACAGAC 


TCTGAGAGGA 


ACAAGCAATG 


ACACTAATCA 


23251 


ATTGGAATGC 


TGGAGATTTG 


AAATATTGTC 


TGTGTATTAG 


ACTTCATGAA 


23301 


AGAAGAGAAT 


GAAATAGTTC 


TTCAAAATTG 


TGCCATACTT 


TTTTTAAAAA 


23351 


GACTCTCCCC 


GTATTTTTAA 


AATAATGCCT 


AATTATAAAT 


AG.TGCCACCT 


23401 


GAAGCACTAA 


TTAACAGGGT 


ACTCCAAATA 


TAATCATCTC 


ACAGATATTC 


23451 


AAATGAATTC 


TTTTTCTAGT 


AATTAGCTTG 


ATAGGGTTAA 


GTGTTACCTT 


23501 


TTTAAAAAGA 


GTTGCAAAAT 


ATAAGACATT 


AACAAATAGC 


AAAACATATG 


23551 


TTTTCATTTT 


ATCTCTTCCA 


TCTCTCATAA 


TGTTTCTTCT 


GACAGCCAAA 


23601 


TTTTTGTAGC 


TATGCACTCA 


GTCCTCTCAA 


TATATGAGAT 


TTTTGATCTA 


23651 


AGCCAATACA 


TTTAGGAAGG 


GAAATAATAT 


AAAGAAGCAT 


TCACATTTTA 


23701 


CACATTGTTT 


CACGAAGTGT 


GGTGATATCA 


AAC TCT AC AG 


GCACATATAT 


23751 


TTGTGTA'TTT 


CTCCTTAATT 


AGGGAAAACC 


GATGATGCCA 


AAGCTTCTGA 


23801 


GAAAGGAAAA 


GCTCCCCTAA 


AAGGATCATC 


TCTACAAAGA 


TCTCCTTCAG 


23851 


ATGCAGGAAA 


AAGCAGTGGA 


GATGAAGGGA 


AAAAGCCCCC 


CTCAGGCATT 


23901 


GGAAGATCGA 


CTGCCACCAG 


CTCCTTTGGC 


TTTAAGAAAC 


CAAGTGGAGT 


23951 


AGGGTCATCT 


GCCATGATCA 


CCAGCAGTGG 


AGCAACCATA 


ACAAGTGGCT 


24001 


CTGCAACACT 


GGGTAAAATT 


CCAAAATCTG 


CTGCCATTGG 


CGGGAAGTCA 


24051 


AATGCAGGGA 


GAAAAACCAG 


TTTGGACGGT 


TCACAGAATC 


AGGATGATGT 


24101 


TGTGCTGCAT 


GTTAGCTCAA 


AGACTACCCT 


ACAATATCGC 


AGCTTGCCCC 


24151 


GCCCTTCAAA 


ATCCAGCACC 


AGTGGCATTC 


CTGGCCGAGG 


AGGCCACAGA 


24201 


TCCAGTACCA 


GCAGTATTGA 


TTCCAACGTC 


AGCAGCAAGT 


CTGCTGGGGC 


24251 


CACCACCTCG 


AAACTGAGAG 


AACCAACTAA 


AATTGGGTCA 


GGGCGCTCGA 


24301 


GTCCTGTCAC 


CGTCAACCAA 


ACAGACAAGG 


AAAAGGAAAA 


AGTAGCAGTC 


24351 


TCAGATTCAG 


AAAGTGTTTC 


TTTGTCAGGT 


TCCCCCAAAT 


CCAGCCCCAC 


24401 


CTCTGCCAGC 


GCCTGTGGTG 


CACAAGGTCT 


CAGGCAGCCA 


GGATCCAAGT 


24451 


ATCCAGATAT 


TGCCTCACCC 


ACATTTCGAA 


GGTAAGGATG 


TATAAAATGA 


24501 


TGCTGGAAAA 


ATATAAAGGA 


TAAATATGTG 


TTAGACACAT 


ACATTACATA 


24551 


TAAATGTGTG 


TATATATATA 


TTTTAAATAT 


GTATAAGGTA 


TATAATATAT 


24601 


ATATCTTAGA 


ATTCTTTAAA 


GTACACAGTG 


AGCTCTATGA 


AGCTTATCAT 


24651 


ATAAACAGCT 


AGCAAAAAAA 


ATAGTTCTCA 


TTTTGAGAAA 


CAGTCAAACT 


24701 


TCAAAGTTTC 


ACTGTCATTG 


TGATACTAGC 


AACACAAACA 


TCTAAGAGAC 


24751 


TTAAAAGCTG 


ATGGTAATAC 


CTAAGTGTAG 


TGATAAGGCA 


AAGTAATAGC 
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24801 


TTGTAAAATT 


TCTATAGATT 


TCCATTCCTC 


CTTTTCACAT 


TAAAAATTAA 


24851 


AACCAAATAG 


GTTTTCATGA 


CTTTTGGCAT 


TCATTTCCAG 


TGTCATTTTC 


24901 


TTGCTGGCTC 


TTAATGAGTT 


GGTGATCATA 


AATGTAGATG 


AAGTTGTTTT 


24951 


CCTTGTAACA 


GATTCCATTG 


GACAGATTTA 


TACAGTGTCA 


TATCTTGACA 


25001 


CATTAAAGAC 


AATCAAGATA 


TGACATAATT 


TGAAACTATT 


CCAGTGTTTG 


250.51 


GTACAGTATC 


ACAACTGAAG 


AGTGGGCTAA 


GCTTTCTAAC 


TCTTCATCTG 


25101 


CTTTCTTTGA 


CATGACTCTG 


GTAAGGATCA 


TGACTTGGTT 


TCTGTTCCTG 


25151 


GATTGTTTTT 


GGTGTTAAAT 


ATGTGAAGTT 


CTGCTCTAAG 


ATATCACTGT 


25201 


TTTTAAATAC 


CCATGTGTTT 


TTAAGTGGTA 


GGAAAATAAA 


TGCAGTTAAA 


25251 


AATTGGGGAC 


AAATATCTAA 


ACCTCTCTGA 


GTCTGTTTTC 


TCATCTGCAA 


25301 


AATGGTAGAG 


TGTGGTTTAT 


AGTTCATTAT 


GGGTTCAATA 


TTTTTAATGT 


25351 


TTGTTTTTAT 


TCTGTTGACT 


AAACCCAGAA 


CTTTGATATC 


TTGGAAAGGA 


25401 


AAGATTTTGA 


AACATTTATT 


TTACAATAAA 


GCAATTTCAG 


ATACCTGATT 


25451 


GTTTGAAAAA 


CCTAAAGGCT 


TTATTCCTCC 


GTAGTAATAT 


TAATGCTGCA 


25501 


GAACTGUCTT 


TTTAAAATAC 


TGATTCTCAT 


TGGGAAGAAT 


GAATTATGGC 


25551 


GTATAGGGAG 


AGTAAATATT 


TCTGTTTCTT 


AAGTAAAAGC 


CAATAGTGCC 


25601 


CTCCTGTGGC 


CCATTACCTA 


TGAAACAATT 


TCTCATATTC 


GTCATAAAAT 


25651 


ATTTCACTGT 


AGGAAATATG 


GATTTCATTG 


CAACTCAATT 


AGTAATCATT 


25701 


ATGC C ATTAC 


TTCATATCAT 


TGTATTTCCA 


TATTTACATA 


AATTTGATTC 


25751 


T AC CATC TG C 


TTCATTTACA 


AAACTAAAAT 


GTTTTCTGAA 


CTAAACTCCA 


25801 


AAATCTAACA 


GCACCAGCTC 


TGTTTCAAAT 


CACTATTAAA 


AAATG TATTT 


25851 


G AATAGC AC T 


GGCAACTGAC 


ATAAAACCCT 


TTGGCCTCTG 


CTGGGGAAAA 


25901 


TACAGACAAA 


CTGACTTGTT 


GCCGACAATA 


TCAATATTGT 


TTCCAACCAA 


25951 


CTGCTCCCTG 


ACAGTGACTC 


AGACCACCAG 


ATACTCAACA 


CAACTCCCTA 


26001 


AACTTGCTTT 


AAGCGTTCCA 


TCTAGATTTT 


GAATAAACTG 


TTTAAAAATT 


26051 


TAAAAATAAA 


AAAAAAAGAG 


AAGAGCTCAT 


TTAAGTGTTG 


TCTATCGAAT 


26101 " 


GCGTAGAAGT 


TGTTTCATTA 


TAATGGTTCT 


GTAAATAGGT 


AACAGCAAGT 


26151 


ATGGTCAAAC 


TACTGACTTT 


GAGTGAAAGT 


CTCATGATCA 


CTTAAATTAT 


26201 


GAAAACCAGG 


GGTTTTCATG 


TTTGACTTAC 


TTTTGTTCCA 


CCCACTTCCC 


26251 


CTCTTTCCCT 


AGTAGCAGCT 


C AG TACTG AC 
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26301 


TTCTGCACTT 


GATAAAGAAG 


TCCAAGCTTA 


TAAAAGTTCA 


TTAACATAGA 


26351 


GACAGGAAGT 


GCTTTGTAGT 


TCAGTACATC 


AAAGCACACT 


TGGCTCTGTG 


26401 


TACTGTAACC 


CGAAATATTA 


AATGTGGATA 


TTAGCTTCTT 


GGAACAACTG 


26451' 


AAGTTGTTAT 


TTG TTTTTCT 


TTTAGGTTGT 


TTGGTGCCAA 


GGCAGGTGGC 


26501 


AAATCTGCCT 


CTGCACCTAA 


TACTGAGGGT 


GTGAAATCTT 


CCTCAGTAAT 


26551 


GCCCAGCCCT 


AGTACCACAT 


TAGCGCGGCA 


AGGCAGTCTG 


GAGTCACCGT 
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26601 


CGTCCGGTAC 


GGGC AG C ATG 


GGC AG TGCTG 


GTGGGCTAAG 


CGGCAGCAGC 


26651 


AGCCCTCTCT 


TC AATAAAC C 


C TC AG AC TTA 


ACTACAGATG 


TTATAAGCTT 


26701 


AAGTCACTCG 


TTGGCCTCCA 


GCCCAGCATC 


GGTTCACTCT 


TTCACATCAG 


26751 


GTGGTCTCGT 


GTGGGCTGCC 


AATATGAGCA 


GTTCCTCTGC 


AGGCAGCAAG 


26801 


GAT AC TCCGA 


GCTACCAGTC 


CATGACTAGC 


CTCCACACGA 


GCTCTGAGTC 


26851 


CATTGACCTC 


CCCCTCAGCC 


ATCATGGCTC 


CTTGTCTGGA 


CTGACCACAG 


26901 


GCACTCACGA 


GGTCCAGAGC 


CTGCTCATGA 


GAACGGGTAG 


TGTGAGATCT 


26951 


ACTCTCTCAG 


AAAGGTGAGC 


TTTCCTGGAG 


GCATTGATAA 


CATCTTCCCC 


27001 


CTCTTCCCTG 


CACTATGCCT 


AACCCCCACC 


CCATTAAATT 


CCCTTGATTT 


27051 


CACTGTGAGT 


GCCCCGGTGC 


AAAAAGATGT 


AAGACTGATG 


AAACCGGGCC 


27101 


TTTCATTTGC 


TCTCATTACC 


AAATTTACAG 


AGGAATAGAA 


TCATTAAAGG 


27151 


TAGGGTGAGT 


GGATAATTTT 


GTTAATATGA 


ATGCATACAT 


TTATACCCAG 


27201 


TAGGCAATGT 


GAATAAAATT 


CAAGGAATGT 


ATTTAGATAT 


TGAATGAGGT 


27251 


CTCCTGAAGA 


CATTTTAATG 


ATTTGGCTTA 


AGCTTCAGAA 


CAACACTAGC 


27301 


TCCTTAT&AT 


GACTTAAGCA 


TTTTGAAAGA 


CCAAATTGAA 


ATTATTCTAT 


27351 


AGTTATGCTC 


AGAGCAATAT 


GTTAAATTTG 


TTCCATTTGT 


ACTTCTATGA 


27401 


AAAAAT AG C A 


GATGGATTGC 


TGGGAAATCC 


TAGTTGGCCT 


GGTTAAAAAA 


27451 


AAAAAAAAAA 


TCAATTGTCA 


GCCATGAATC 


ATTAGAGAAA 


ATTATAGTGT 


27501 


CAGTGCCATT 


TTCAATAGAC 


TGCTTAAAAA 


GTAATCATAT 


TACAAAGTGT 


27551 


TTCTCATTGG 


CTTTATATAT 


ATATATAAAC 


TTAAAGTAGA 


GGACATAGCA 


27601 


AGGCATTTCT 


TAC CT AAT AT 


GCTTACTGTG 


AAGCATCCCT 


TTTGAGCAAA 


27651 


ATCACTCTAA 


ATTTTCTCCT 


CAAAGTGATC 


CTCTCTTGAT 


TATACTGTAC 


27701 


TGAC TC TTAC 


CACCAGGAAA 


ATGTCTTAAA 


ACCACTTCTT 


TTTCCTGATA 


277 51 


AATGCAATGC 


TATTTGTCTC 


TTGACATAAG 


TAAAGCTTTA 


AACATGGTCT 


27 801 


TGGCCACATG 


TGGAAAGAAA 


TACTGGTCAC 


GTAAAATACC 


TGATATATCT 


27851 


TTCTATGTCT 


TCCCCTGTTT 


TTTTTATTTT 


TTTTTTATTT 


TTATTTTTTA 


27901 


ACTCTGATAT 


TGATGATGGC 


ATTTATTTTC 


TAGACCTTCA 


GCCTTACTCC 


27951 


CGGAATGATA 


TTTTTAAACA 


TCAATTAAAG 


CCCTTAGCTA 


GACACTCTCT 


28001 


GCATTACGCC 


AGTTTCCCCT 


TAATGTAGGA 


TGTCCCAATT 


TGAAATTCCC 


28051 


CATTTTCTCT 


TGACTTTGTA 


AAATACAAAA 


CCCAGAGCAA 


AACATTGCTT 


28101 


CTTTCCCTCT 


TTACTTCCTA 


CTTGCCTAAC 


AATGAGACAG 


GGACAGCCGT 


28151 


GCAAATGGGG 


CTTTCCGATG 


ATAAAGTAAT 


TTTAAC AC TA 


ACTAAAATAT 


28201 


TGGTGTTTCC 


TATGGTGGGC 


TGCTAATTAC 


AAAATACATT 


TTTCCTCCTA 


28251 


AAGAAAAAAA 


CTGGGCCAAG 


GCAAACAGCT 


CAGTGATAGC 


AAATAAAATG 


28301 


TAACCATTTC 


C CT ATGGTTT 


TGCTG TTATA 


TGCTATTATA 


GACAGCATAC 
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28351 GTAAAGACCA GTAAGGGTTC ATTTTTCCAC CTAAAATGTC GGGCTTCCTG 
28401 TAAAATCTTT GATTCTAGTT TCAGCACTTC TAAGGTAAAT GGGCATCTTC 
2 8451 ACATGTCATT TATAAAACTT CTAATGAATG AATTATATTA AAATAGATAA 
28501 ACAACCTATA GTTTTAATGA ATGTATCCTA GATTGTATGC TCATATGTAA 
2 8551 GGATTCTAAA TATCAACTTG ATAACCAAAC CAAACATAGT GCAAATAGGT 
28601 TATCATTTAT TAACCACAAC CACCTTCCAC AAAACTGGTC ATTTTTTAAT 
28651 TATTAAGATA ATCTGCAACA AGTTGGCCAT TTAGCCATCA GCCTATTTCT 
28701 TCAGCATTTA GACATTAATC CCAGATTCAG AAATAAAGTC AAGTAACTAT 
2 8751 TTATAACCAA GTAACATTCA AATCAAAACT AGATGAAAGA TTGGTTAGTT 
2 8801 GCATAGCTAT AACCAAAATG C AG TTTTAAT ATTTTACTCT AATCTATATT 
2 8851 TTAACTGAAG TCAATAAAAT TTTCACTATG GAAATACACT AGAAAATATG 
2 8901 CAATTTCTTA TTCTTTTTAA GCAGATTTAT TTATTGTACA TGTTCAGTCT 
2 89 51 TTGAAATAGG CCAATTTTAT TTATGTTATG TTATGTTATT TATTTGTTTT 
29 001 G AAATGG AG C CTCACTCTGT CGCTCAGGCT GGAGGGCAGT GGTGCCATCT 
29051 CAGCTCATTG CGTCCTCTGC TACCCGAGTT CAAGCAATTC TCATGCCTCA 
29101 G C C AC C TG AG TAGCTGGGGT TATAGGAGCG GACCACCATG CTGGGCTAAT 
29151 TTTTGTATTT TTTGTAGAGA TGACGTTTCA CCATGTTGGC CAGGCTGGTC 
292 01 TCGAACTCCT GACTTCAAGC GATCTACCCT CCTTGGCCTC CCAAAGTGTG 

292 51 GGGATTACAG GTGTGAGCCG TGGCACCAGC CTGAAATAGG CCAATTTTTA 

293 01 AAATGGGAGT ATTCCTACAT TAAAATGGCC AAATAAAGAC TTTTTCTAAA 

293 51 ATAAAC TTTA AACTAATTTT GGATAAATAT GTTTTGCCTT TGAGCCTTAA 

294 01 TAAAATG CAT TAATGAATAT TAAGCTGTAA AAAGTACATG TTAACTACAT 
29451 AGCTATAGTG TATAATATTA ATATTAATTA GTGCCTTCCA GTAAATTACT 
29501 AGATTAAAAT AAATTTTAAT ATAAGACACT GAGCTTTTTG TTTTCTTGAC 
29551 AATAGAACTG CAAGCAATAG CAAATTGCTC TAATCCTTTC ACGTACATTT 
29 601 AAGAAAGTTT ATGACCTATT GAAGAGAAAA GTAGATCTAG TGGGTGATAC 
29651- TGGCTTCATT ATGGTTAATT AATTGATCAG TAGAATGTCA GAAATGCTAA 
29701 GAAAACCAAA GAACTACACC AGAGAGAAAA TGTGTTAATG TAAATTTTAA 
297 51 GGCAAGTTAA TTAGCGATAT ATAATAAAGA TGTATATAAG TTCATGATTT 
29801 ACCTGTTTGT CTACAATTTT AGATGATTTT TTGATACTCA TATTTAAATC 

2 9 851 GGTAGCTTTT CCTATAGATT TTAATTTTTG TTTAAATTCC TCTTCGTTAA 
29901 ATTAAATAAA ATAATAAAAT ACACTTTTTA ACAGTTTTCT CTTCTGCAGC 
299 51 TGCTCTAGGT CATTGGTGGC CATTGAGCCA TAACTAGTCT ATATTTGTTT 

3 0001 TGGGTTTTGT TTCATGTGTC TGACTCAACT AAATTTTTAA ATAATTTGTA 
30051 GTAACCAACT TTGCAAATTC TGGGTTTGTC TTTAAATGTC AG ATC TGGC A 
3 0101 ACGCTGCCTT GACATTTCTG CCTAGAAACT ATTGGCTCTA GGCAGTCAGT 

Fig. 2 ( (cont'd 16) 
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3 0151 GTCTGTCTGC TTCAGACTGT TGACTGAAAT CCCCATTCGT TTTCATGCCC 
3 0201 TATCTGGCCC TTGCTGGCAT ATGAGTTTGC AACCTTTGGT GATTTGCAGA 
3 0251 AATTGTCTAT GTTAGAAAAT CATTAATATC TAGATTCAAA CATATTTCTA 
3 0301 AATAAAGCTT TAAATTATTA TGGTAACTTT AAATGTATTT ATTCTAATTT 
3 0351 TTTTCATTAA ATTGCTCTTC ATCATATAAA TATATAATTT TTATACAACT 
3 0401 GGATGAGTTT GGCAGAAGAA TACCAACTTT TCATATTCTT TGTGGCATTA 
3 0451 AACTTTAACT TGTACACATG GAAATAAATA ATCCTTAAAA TGACTTATGA 
3 0501 CCACATAAAT GCCTTAGCAC ATGTGG TTC A TATTTGGAGA TTTCTCATAT 
30551 TTGTTCAATA TAATTTATTT TGTTTGTTTA TCCACAGTAC TTAAGAAAAC 
30601 TTCTATAGTC AACATATATA CTGTAACTGG CCTCTACACA GTATAAGCAA 
3 0651 TTACCTTACA TGGCTATTAC CGATAAAGTT AAAGTTGTAT AAAGCCTTTG 
3 07 01 GATGCTTTTG ATTTCAGTGC TAAATAATGG AGTACACATA GAAGAAAACA 
3 0751 TTTTAGCTTT GGTTTGAGTG ATCAAATTTT AGGTCAGCCT TTTTACATTC 
3 0801 ATGTTATATC ATCCCCATTA TGCGTATCCT GTGTATTTAA TTTTGATCAT 
30851 TTGATGTCCT AAAGGAAGAA AGCTATAATT CTGCAATTTT AATTAATTTT 
3 0901 ACACTTTGCT TATCCACATG CCAGAGATTA TAAAAGAAAT CCCTAAACTT 
3 0951 GTCCCACTTA GTTGTTGATA TCCTCTTCCT GTATTTTTAG AGAGGCCATT 
31001 TCTTATTTTC TCTAGACATA GCTTTTCATT CCTTCTTGTT ACCAATTGTG 
31051 AATTCCTTAA AATAGAGATG ATAAAATTTA TAGCCTTTTA AATACCTAAT 
31101 TTATGATTTC TAAAAGATGG TATAGCTTAA TTTCATTAAA ATATTCAAAT 
31151 AAATGATACT AGAATCAATT AAGTTTTAAG CAAACATTCA TATATCTTTC 

312 01 TTCACATGTG TAAATGGGAA ATAAACATGC CTTTTTATTA AAAA.TAATTT 
31251 GAAGACAAAA GATAAGTATT AAACAACGTT TTATACCATC TCTGTCAATT 

313 01 GGAAGTTGTC ACTCTAACTT AGCCAGAGCA GATCTATCTC ATTTTGCATG 
313 51 TGATATCATA GCAAAAGTCT AATCAGTTGC ATAGGGAAGG AAAAACTAAG 
31401 ATAGTATTTA ATCAATAGGA TTCAGAGGAA AATTATGCTA ATGTGATTTA 
31451 ATCTATTTTC TAG T AATC C T ATCACTAAAC TGTCATTGAA TTGTACTGCA 
31501 TTAGAAAGGA ACTCAAATAT GTGTGACGGC AATGGACATC TTGTCACCTT 
315 51 TAGTTGGCCT TTTTCAATGA GTTAAGCATT ATATGTGTGT TACCAAAAAA 
31601 TTATTTTTTA TAG TTC AG AG AACCATTTTT GTTGGATGTG TAATTTGGAA 
31651 GTTTTGTTTA CATTATGTCC TTAGGGGTTT TCTTTGTTTT AACAGCATGC 
317 01 AGCTTGACAG AAATACACTA CCCAAAAAGG GACTAAGGTA TATATTCCTC 
31751 TCAGCACAAT TGCTACCTCT CTGTTGTTAT GTAAACTTTG TGTGCTGTCT 
31801 CTCTTCCTTC TTTGTTTGTT TGCAATGTAG CACATGACAT TGAGGACGAA 
31851 ATCACTTTTA ATTTTGATGG TTTCTCTGGC CCGAACAGTT GGTGAGATAG 

Fig. 2 (cont'd 17) 



31901 


CCCCTTAGGT 


AGAGATACTA 


31951 


TAAATTCCAA 


TGTGAATATC 


32001 


AAACAAACAA 


AACAAAAACA 


32051 


TTTTTTGGGG 


GCAGCAACTT 


32101 


TAAAATAAAA 


TTAACTCTTC 


32151 


CAAATATTCA 


AAACCCATGC 


32201 


AGGTTGAAGG 


ATGCTACAAT 


32251 


ACAGGACAAA 


TGCTGTAATT 


32301 


AAAATCTCAA 


AATTGTTTTA 


32351 


ATCAATTTAA 


ATCTGAATTA 


32401 


TTATTCCATC 


CATAATAAAA 


32451 


TTTTACTGTG 


TACTTTTATC 


32501 


AGGTATAACA 


CTGTATGTGT 


32551 


ATTT AC TT AC 


AAAAAGAAAA 


32601 


TAATGGCpAA 


ACTAGGTCTT 


32651 


CAGCAATAGC 


AAATAGCTGG 


32701 


AAATTTATGT 


TCCAGTAAAG 


32751 


CTAGGCATGA 


TGTCAAGTGG 


32801 


AAAAAAAAAA 


AAAAACAAAA 


32851 


CTTCTGTGAC 


CATCCTTCAT 


32901 


CATTTGTAGT 


CCATGTTTTC 


32951 


TTTTGAGTTC 


TCAATTAACT 


33001 


ACTGCATATC 


TGATAGAAGC 


33051 




CTATGTGATT 


33101 


ACTGAGCAAT 


TATAATTAGA 


33151 


TAGTAACCTC 


CTCTGAATTG 


33201 - 


TAGATGTGCT 


TAAGTGTCAT 


33251 


ATCTCGGCAG 


GCCAACCAAG 


33301 


CTACTGGAGG 


GCTTCAGGAC 


333 51 






33401 


TAAAATATTC 


TAAAATATTG 


33451 


TCACAGAAAC 


CCTGGAATCT 


33501 


GGAGGGATAA 


AAGCACTTTA 


33551 


TCTTCACAGA 


AAAATCTCCA 


33601 


TGGTTAACAT 


AGATTAATAC 


33651 


GCTCAGAAAG 


TTCACTTGAC 
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GTAGAGATTG AGGCTGTCTC TCAAATTAAA 
ACTATTTTGA AGAAATAATA CTAAACAAAC 
AACAAACAAA AAACTTGTCC CAGGCATTAC 
TGGTAGAATG CAGAACTCAC TTCAACAAAT 
TAACTTTTGC CTATTAGAGT CATATGCATG 
AGTCTACAGA TGTGGGCAGT TAATGTTGAT 
CTGAATCAAA GAAAACATAT TTTCATCATC 
AAGGTGTGAT TTTTATAGAA TCCTTTTGAT 
ATTTCTATTT TGCAGGGGTA CTGCTATCAG 
ATCTAATATC ATTTAATAAT CTCAAAATAA 
AATAAAATAA AAATTTAACT TATGGCCATC 
TGAGGAAGAG ATAGAATGAT CTACTAATAG 
ATGAAAAGTT GGCTAATTTT GGTGCTAAGA 
AGAATATACT TAGTTTGGTG AAACACTGAA 
TCTCCATTAT TTTTTTTCTC TCCAATTTTT 
CAATTATTCC ATGTTAATAT TTTGATC C AG 
CGAGCACATC TCCCTCCTTA TTTTTGTAAT 
CAGTTTAACA AAAGAACTGT TTTTCCTTTA 
GCTGCCAATA TGTATTCCAT TTCCCTATGC 
TTCCCTTGGC CCTGGCCCAC CACTGTCCTC 
ACCCTCTTTA CATCCTTTCT TGCCCTGTGC 
TGGCTGTCTG CTCATTGCTT ATGATTTCCA 
ATAATTTTCT CCTCAAAACC CTTTATCTTA 
CAAACAGATG GCGTAAGATC ATCTGGAAGA 
TTCAATCTGT TTGAAATTGT TCATTCTGAA 
TTTTCCTGTC CTGGCATTGC CTTGCCCTTG 
AGCTGTGCTG TTTTGCAGAT ATACCCCATC 
AAGAGGGCAA AGAGTGGTTG CGTTCTCATT 
ACTGGCAACC AGTCACCTCT GGTTTCCCCT 
AGCTGGAAAA TACCACTTTT CTAACTTGGG 
ATTTTGTTTT GTTTCTTTCA CCACCCACTC 
CTCCATAACA CAACACGTTT TCATTTAAAG 
ACAGTACCTT TCATTTGTGT CATTGTTTAC 
AACATTATGC TATTTATTGC TCATGACAAA 
TGTGGTTGTT TTCTAGTCTA GGCTCCAGAG 
TTGAAAAAGT CTTACCATTA CTAAGGGTTC 

Fig. 2 (cont'd 18) 
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337 01 AAGGCAGTAA CCAGTTCAGA ACATCTGACT TTAATCCC&G GGGCCTTTCC 
3 3 751 ATTCCATTTA AGAATCCTCT TAAAAAACAG GAAGGCATCT CCTTATTTAT 

338 01 TTGTCTGAAA TATTAAAACA TCCTTAAAAC AAAATTAGTA ATCTTTTGTA 
33 8 51 GAAAATAGAA ACAATTAGGA AGAAAAAAAT ATGTAATTCC ATG AC TC AAA 

339 01 GTTAACTTCT TTTAACACTG TTAAAGTTAA AACTCCTTAA AATTCATACA 
33951 AGAATTTCTG TTAAGACAAT ACTCTGAACA TTTTCAAATA GATACAATGA 
3 4 001 AAAATAAATT ACCAACTTAG TCATTGGGTT ACTTTGTATT TAACATCATT 
3 4051 TGTATGAAAT ATAAAATCAT TTGCATAAAA TTTCATTAAA AGCACTCTGA 
3 4101 GTAACAAAAT AATTAAAGAA AACTAAACAT GCCAGATACC ATTTAATAGA 
34151 TTCAATGACT TTAAAAATAT ATTTATTTTC TATAAAGTCA CATATAAAGT 
34201 ATTTTCATTA TTTTTATGGT AAATATTTTT ATTATTAGTT TATCAGAAAA 
3 42 51 ACTTGTACAT AAAGATGAGT ATTGATACAT AATCTTATTA GAGCCAGAGA 
343 01 CGATCATTCC TTCTAGAAAA ACACATCTCT GAATTTAGGA CGGAGGACAA 
34351 TGAAACAAGA AATTTCACTT TATAATTTAC CTTTGTCAAA CTATCCCAGA 
3 44 01 GCACATCAAT TCCATCATGA AAGTACTCTT TTGACATTAT ATAAAAAATT 
3 4451 AGTAATAGAA AACACACAAT CCAAAACCTT ATATTTTCTA AACTTCAAGT 
345 01 TAATCATCAA C TTC TCTTAG ATTTTTGAAG ACCTGAAAAT AAACATAATT 
3 4551 TCAAATAACA GAACTCAAAC ACCATATACA TTTGTAATGA GGCACAACAG 
3 4 601 TCAATTTTGA GCCTTGTATT TTCCAGGTTT TAGCTGAATA ATCTTCACTG 
3 4 651 CTTTCTTAGC TTTTTGCCAG TCTAGTTTGG GGACTATTTT GCCTTACTGG 
347 01 GCCTAAACAG AGTGTAATAT TAAAATATGT TAATAAGCCA TACTGAGAAT 
347 51 AAGATAAATG CAGGTTTCTA ACTCCTTAGG GACACAAGTG GGGACAACAC 
34801 ATTCCATGAA CACAGGTGAA TGAATGCCCC TAGTTTCTCT GAGTTGGACA 
34851 ATTTCATGCG ATCATTTTTT TCTCTGAGGC CAAAGTCTCT GGTTTGATCT 
3 49 01 TCCTAGCAGC TTCCAGAACA GAAAGTGAGT TTACTTTGTC TCCATATTCT 
34951 TTTTCTCCAT GCTCGGGAAT CCCCTGCTTT CCTGATCCCA CCACAAAAAC 
3 50 01 TCCCCTGAGG ATGAAGCCTT GGCTTTCCAG GCTTCCAGGG AAGCCTCGAT 
3 5051 TCCTGGCTGG AGGTAGTTGT ACCACACTCC CAGAGGGCTA AATCCCATAA 
3 5101 ACATCATCTT CTGTCTTTGT AGATCATAGA ACTTTTTATT ATCATCCAGG 
3 5151 AAGATTTCTC TTTTGAAACA AGGCTGGAAA AACTTTATGT CAGTCCTGAC 
3 52 01 CTGCTCTTTA ATGACTGCGT AGAGGGAGAT GCCCAGCTTA TCCAACTTGG 
3 5251 GTTGCAGAGA GGACAGATCT GCAGCCCCTC TTGCCAGAGA AAACATCCTG 
3 53 01 GCACAGCCAC AATCACAACT CCATTCTTCT CCCGATAGCT CCTTTGCTTT 
3 53 51 GAAACTCATT GGTTACTTCT CCAGTG TTTT CAGGTCTATA TTCTCCAGGT 
354 01 ACTCCAGCAC CTCTTTCCAG GGCTTGGACA AAAATACATC TGTGTTGGCC 

Fig. 2 (cont'd 19) 
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35451 


AGCATCAGTG 


CCAAGGCAGC 


AGCCTCCAAG 


GGCTCCTGCA 


CCCATGGACC 


35501 


ACATCCACAC 


AGAGAAGCAC 


CTTGGGTCCT 


CAAGTGCCTC 


CCTCTTCTTC 


35551 


CCTTCTCCCA 


AACCTGAAGC 


CCAGACACTA 


AGGGGTCAAA 


CCCTCCTGGG 


35601 


CCCTGAGGGT 


TCCAAGGGCC 


TCATTACTTT 


TTCTTTTTTT 


CACTGGAAAA 


35651 


AAAATTCTAA 


TCATGCACCT 


ACAGAAGATT 


GACATTTTTC 


AGTAAGTTGG 


35701 


ACTTTCCAGC 


TTTCAGCCAG 


GACAAGACTC 


AAGGCTATGT 


CTTTTCTATT 


35751 


GCAACCCTTC 


CCACTATATT 


GAGTAGGGCT 


TTTAGCAATT 


GAAAACAATT 


35801 


ATTTTGGTCA 


TGGTTTCATA 


TAAGCTAATG 


ATTTCATATC 


AAACACCAAG 


35851 


TTTTTGTTTC 


CTAACCTATA 


TAGTGATAAG 


AGAATTTACC 


TATAATGCCA 


35901 


AAGAATGTAT 


AGC TTTTATT 


TGCTTTAAGA 


TGCAGTTGAT 


TTTTTAAAAA 


35951 


AGCGAAAAGC 


CTAACACTTT 


AACTTCAAAA 


AATGAATTTA 


AAATGTTTGT 


36001 


GTAGGTCATA 


GGAATATGAA 


AAAATTTTAT 


ACAACATCTA 


AAACACACCC 


36051 


AAATCACCTA 


AAG TGCTATA 


AGCTTGCTAA 


GTACTTCATG 


TCTCCTATCA 


36101 


ATTCTTTCAT 


TAATTGACGT 


TAATTTGATT 


AGTTGACTCC 


TTCTTCTATT 


36151 


TTTCCTCACC 


ATTATTATTC 


TGATTAAATC 


CACCTTCATT 


ATTCCTTAGG 


36201 


AACAAAAAGA 


CTCACCACTT 


AACTATGTCT 


GACATTGGTG 


AAGTCGTTTA 


36251 


AACTTAATTT 


TCTTATCTCT 


TGAATGGATA 


CATAATACCT 


AGG TTATATT 


36301 


GTAAAGAATG 


ACGGATATAG 


TGTATGTAAA 


GATGGAGAAG 


TG TGTAAG AC 


36351 


TTGACAGATT 


CTGCCAAATC 


ATTATTTTCA 


CTGGAAAGCA 


TGTCTTACAC 


36401 


GATCATAGAG 


TAGCATTCAT 


CAGATATGCC 


TGAGCTTTGT 


CTACATTTAA 


36451 


TTGAGTAGTA 


ATTCGCAACA 


CAGTAACCAC 


AGGATTTTAT 


G TAAAAG AC A 


36501 


TTCACAGATT 


GTGTTTTTGA 


AAGATTGTAT 


TTTTGAAGTA 


CAAAACTATG 


36551 


ACATTGTTAT 


CAAGGACTCA 


TTTACCACAA 


ATATCAAATA 


TTTG TG C AAA 


36601 


GATAAGTTTA 


TGCTAAGATT 


TGCATAAATT 


AAAGTTAACA 


TGGCAACTGA 


36651 


AGCTAACATG 


TCCATGGTCA 


CAATGTGTTA 


AAAAATGAAT 


GGTTCTGTAG 


36701 


CACACTTGGG 


AATG TATTTT 


ATTACATAGT 


TTTCAGAGTT 


AAAACACAAT 


36751 - 


TAATAAATGA 


AATGTGAATT 


ATACTTTTAC 


TGACAACAAA 


GCTCTCTGTA 


36801 


GAGCTTTAAT 


GTTCTAATGA 


ATTAGAAAAC 


CACTGATCAA 


ATACATCCCT 


36851 


TACATTTCAT 


TGCTATAGAA 


ACCAAGTCTG 


AAAGGTTAAG 


TTTACCTTTC 


36901 


TAGGATGTGG 


GTTTCCCCCC 


TTAATCTATT 


GTGGTTTATA 


TCAGAGATCT 


36951 


CTCAGCTGTG 


TCAGACAGGC 


CATGACTTAA 


GTGACACTGC 


CCTCTTGATT 


37001 


CTCTTCATAC 


TTTTCCAACT 


ACAATTCTTT 


CTCCTGGGGT 


TGCTCATCTT 


37051 


AACATAGCTG 


TATCATTTAT 


TGTAGACACA 


AGGTCACTTT 


TGAGAGTGAA 


37101 


TGGGACTATA 


TTAATAATTG 


TTCCAGGTAT 


TAGG TG C AAA 


CCCTGGGCAA 


37151 


TGCAATTCAT 


CCTCCATCTC 


CTCCTTATAT 


TTATGTGTTT 


ACCAAGTTGT 


37201 


TTTTCCTGTA 


GACTTTTTTT 


TATCCTAAAC 


CCTTTTTCTA 


TGTTCTCATT 



Fig. 2 (cont'd 20) 
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TV Tv TV Tv Tv ftlfTVi 

AAAATGATTG 


37801 


ATTCAAAAAC 


ATAATATTTA 


AAGAGTTTTT 


ATTGTATTTA 
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GAACCTCTGT 


TCATTGGGCA 


AGAAAATGAG 
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ATGCAATAAT 


37901 


Tv tv Tv mfrTTv Tv tv /*^m 

AAATTAAAG T 


TAC TTT ATT A 


TTAAATTTTA 


AATATATATA 


TATATACTTA 


37951 


^ti I'll 1 ■ i *pl >. Vfn fh rn 

C CTTAAATAT 


GTCCTCTTGT 


TG TCTTTTAG 


C ATC ACC C AT 


TTTTGATTTG 


38001 


ACCATTATCT 


TTTCTGAATA 


ATCAGTAAGA 


m Tv f~~y tv tv mm Tl 

TACAGGATTA 


TTATTAATGT 


38051 


TCAAAAGTTG 


C AG TATTC AT 


GTTTTCTTTA 


TTCTTTCTAC 


CAATTAAAAT 


3 8101 


GTGTTAATAT 


ATAAAATTTT 


1 AOAAA 1 111 


A prp Tv rp Tv A A A 7, 

AL 1 A 1 AAAAA 


TV m/"^ TV /"^ TV TV ^1 TV m 

ATCACAACAT 


joIjI 


A H"i A rnrp A P A A A 

ATAX 1 AG AAA 


71 mm A A O A rpP A 

All AAOA 1 LA 


z^irp A P* A A rp A TIP 1 

L 1 AL AA 1 A X G 


TCATATTTAG 


TAG AC TAC TG 


*3 O *5 f\ 1 


1 OAOL X AU X O 


PP A PAPTIJV A A 

LLALAb 1 AAA 


P»T> A rp/-'P*rnrpP , P' 
L X A 1 OG 1 1 Lb 


TGTGTCGTTC 


PP A PP A fn/" 4 /"~» rn 

CCAGCAToCT 


JOZDI 


AbLH, X AG X A 


/""AAA PP A fprnp 
OAAALLAl 1L 


P^P 1 A rnrp/-< a A P" A 

LLA1 1 LAAvjA 


A AP'AP'rpA AP"7i 

AAGAL X AAL A 


A A prn a m t\ p pm 

AAG TATAG c t 


J OD UX 


T 1 A P A I 1 A A A T«P 

X ALA 1 AAA X L 


A A A A A P T l P r P r P 
AAAAAb 1 L 1 1 


TV^P 1 A TV A A A P* 

1\j(jA 1 bAAAb 


X 1 LAI 1 1 bbb 


A A A A rp A A PPP 

AAAAX AALLL 




AA1 LOL 1 ALL 


prpmp A A TTTT 
L I 1 LAA1 111 


mm a rn/~« ?\ > TP* A 
1 1 A I bAA 1 \s A 


AAAAA TPP A A 
AAAAA 1 OO AA 


PA A T> A A APPP 

bAA X AAAOOL 


7 ft APil 


PTP rp 7\ tv P A TP 
L 1 L 1 nriun X L 


P A TTP AAA PP 


LAbbnbAbnb 


A P* A A CI A A TTT 
AbAAbAAl 1 I 


PT AAArpAPAA 

L 1AAA1 AOAA 


"3 ft 4. 


OAO/Vri/\L.rVO/i 


A PA PPrpn A rp A 
AOAOO I LA 1 A 


PTTPTTPTP A 
bJ. ILi io 1 VjA 


PPP A TPTP A T 
bLLni L 1 LA I 


AALL 1 bb IbA 


3 8501 


P APTP ATTPT 
VjAL I Lr\ I I O I 


p AT'pppTPr' a 

LA I OLL I LLA 


TP P" A TP 1 A t a a 
1 bLA 1 OA I AA 


P A A TPPPTP A 
LnA I LbL 1 LA 


P A T'rp^* > 1 Mil irprp 
OA 1 1 LA 1111 


jOjjI 


I LA 1 L 1 1\?LL 


A p A apPpfpT 1 A 
AL AAbbb 1 1A 


P 1 A rn/-'/~» a P'P' A A 

LA 1 bLAbbAA 


P" A TT A A TPTP 

LAI lAAlb 1 L 


A A PPT^/Tp/"! TV 0 

AALL 1 0 1 L AL 


^ ft ni 


TTPTA A T* A TT" 
11C 1 AA 1A1L 


P A TPT A A T A T 
LAI L 1 AA1 Al 


rppi m/^* 7\ 7v TV TIT* 

1 L 1 L 1 AAA! 1 


pp A TPP A TPP 
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X 1 X 1 OLA X AT 


38651 


GGTGATTGTT 


AAACACCTTT 


GCATAGGAAC 


AGTTTCTATG 


CTTTTGTACT 


38701 


CAAATCTTCC 


TCTACCTTGA 


ATCCTTTCCC 


ATCTTCGTGT 


TCAACCTTCA 


38751 


ATCTTCTCAG 


AATGAACTCC 


TGTCTTCTAT 


TCTTTCGGAA 


GCATAGAATC 


38801 


TCACGGTCAG 


AAGAGACCAC 


ATCTGGTTCA 


ACCCTTCATC 


TCTTATGTAA 


38851 


AATTTTATGA 


CATCTCTAGC 


TTCTTCTTTA 


AACCCACCAA 


TGACAGAAAC 


38901 


T AC T AAAATC 


TAGAAATAAC 


ACCTTTGAAA 


TT^^ TTT TTT 


AAGAGATCAA 


38951 


ATAAAATTTT 


CCTGAATCTT 


CACCTATTGT 


TCCTAGTTAT 


ATATATC C AG 
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39001 


ATTC TAC AAA 


ATAAGTCAAA 


GTTAGATTGC 


ATATGACAGC 


TCTTCATATT 


39051 


TAAAACAATA 


TAATAAACTC 


ACTAGTTAAT 


GTCTAGCTGT 


AGATGCAAAA 


39101 


GTAGAGAGTG 


ACTTGGGGTT 


ATTTAAAAAC 


CCAGTCCAGC 


CAGACACATT 


39151 


GGATCATGCC 


TGTAATACCA 


GCAGCACTCA 


GGAGGCTGGG 


GCAAGAGGAT 


39201 


CCCTTGTCCA 


GGAGTTACAG 


GCTACAGTGA 


GCTATGATCG 


TGGCACTGCA 


39251 


TACTCCAGCC 


TGGAAGACAG 


AGTGAGACCC 


TGTCTCACAA 


TAATAGTATT 


39301 


TAATAATATC 


ATAAAAACCC 


AGTCCACATT 


TATATAGGAT 


CCTGTTTTCC 


39351 


TCAAGTTACT 


ACAAATAAAT 


ATATAATCTT 


AATAAAAGGT 


TAGTGGCTTT 


39401 


GCCAAGATAG 


TGGCTTGGCT 


ATGCAAATGC 


AATTTAAGAC 


AAAGTTGGTA 


39451 


GCCCTCTTTT 


TCCTAATACA 


TTGCCATATC 


TGTTTCTCTT 


CTATTTGGAA 


39501 


ATTCTTGTGT 


GTCTCTTGGC 


TTCGAATGGA 


TCTTATAGTC 


CTTTTATTCT 


39551 


TCCATTTTTT 


AGTCATAAAA 


AAACTGAAGG 


GTAGTGATTG 


GGTTATTTGC 


39601 


CCAAAGCAGA 


TGGAAAGCAA 


AACTACCACT 


AGAAGCTCTT 


TACCAATTTG 


39651 


TGTTCCATTC 


AAAAAATTAT 


CTTTGTATGT 


CTTACATTTG 


TCTTCTACTG 


39701 


TATAGTTTTT 


CTTGTTCTAT 


TTTACATATT 


AACTTTTCTC 


CTTCTTCAGA 


39751 


CATCTGCCCT 


ACTGGCTACT 


CTTGAAATCA 


GAGACTGTGT 


CATATTTTTC 


39801 


CTTCTATTCA 


ACTACAACAT 


CTAAAAGCAG 


ATCTGTCATA 


GTTATTAACT 


39851 


TAATTGAACA 


CTCTTAAATA 


GTTAGGTGTA 


ATTTCCAATG 


CAGAAGCTAT 


39901 


CAAAAGGGTT 


TGTAAATGCA 


AACTATTCCC 


TTTAAAATCT 


ATCCTAATCC 


39951 


TCATTAATGT 


TTCATCTTGA 


TAGAGCTAAG 


TATTATGTAT 


TGAAATTGTA 


40001 


GAAGTACACT 


TCACTTGGAT 


ATCTCTGCAA 


TCATTTAGGT 


AAGAATTATA 


40051 


CAAAGCCAAA 


AAGCAAATAA 


AATATCCTCC 


TAACCCTATA 


GATACGTATA 


40101 


CTAAAATGAT 


GCACTTGCAA 


ATTTGTTTAA 


TACTTCATTA 


ATTTAAACAA 


40151 


GAGTAAATTC 


ATACTGTGAA 


CCAAGAATAG 


GGTGACTTAC 


CCCAATCTTG 


40201 


CCACCTTAAA 


CATAAACATT 


TTAAGTCTTC 


AATGTC CT AC 


AGTGTACCTA 


40251 


CTGGCTGTTG 


TCACTAATCA 


GACCGAAATG 


GTACTAATGG 


TCACTGCAGG 


40301 


CTGAAGGAAT 


ATGCTTGAAA 


GATAGGCAGA 


TCCTCTCCCT 


CTCCCTTTTT 


40351 


TACTTTTTTC 


GCCTTTCCAT 


CCTTTCTTCT 


TTTTTTCCAA 


TAG ATTG TGC 


40401 


ACTTTGGAGA 


TTCATATTTT 


CTTCCTTTTC 


CATTACATTT 


TAAATATGTG 


40451 


ATTCTTAGTC 


CTATGCTTCC 


TTTTACTCCA 


ATCAATAACT 


GGCTCTATCA 


40501 


GAGGGTTGTT 


CTGTGTGTTA 


ATTCGGTTAA 


TACCAGGATT 


ATCAAGCACA 


40551 


GTGCCTTCCA 


AATGTGAGAT 


ACTTCTCTCC 


GGTTACCTCT 


GGGTTTACTT 


40601 


TTCCTGTTTT 


ACATTGTTTT 


GAGAGCCAGT 


ACTTGTATTA 


AGAAGAAGTT 


40651 


TAGTGCCTGT 


GTCACAGAAA 


AAATCTTAGT 


AAATTTTGAA 


GTGATGTCAG 


40701 


AACAACTCTA 


AGCCACTGAC 


GGATTCCACA 


GGGTTTTGAA 


AATACTCGTT 


40751 


AGTTCCCTTT 


ATATCTTAAG 


AGGCTCCTGC 


CTGCTTTCTC 


ATATACCAGT 
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40801 


AACAAACTTG 


CTTTTCTTAA 


ATATGAGCAT 


TTAGAATATC 


TTTC TC AATT 


40851 


TTTCTGTTTT 


GCTTTTATTC 


CAAATTTCAC 


AACTATATTG 


TTTTCCAATG 


40901 


TAG TTG TAC A 


TACAATCAAC 


CAAATCTTTC 


CTTAAATTGA 


TGACTACCAG 


40951 


GTGAGGACTC 


TTTGGCAATA 


AGCAATAAGA 


AAATAAATTG 


TTATTAAAAA 


41001 


TT AC AG ACTT 


AAG ATAC TTC 


TTTGGAAATA 


TAACATGTTT 


GTGACTTTTG 


41051 


ACCATCTCAT 


C ATG ATATG C 


TCATCTTAAA 


CAGAGTAGAA 


AATCATTTCA 


41101 


TATAATTAAC 


TTTATGGTGG 


GCTGCAGATA 


CCATGTATGT 


TACATTGTGT 


41151 


TTAGTTATAA 


AAATGTTTAT 


TATACACTAT 


TTCCTTATAA 


TCTAACTTTG 


41201 


ATAATAATGA 


TGGTCCTAAT 


CATGAACTTA 


CATCAATTAA 


GAGCTTGAAG 


41251 


TGACTGAGAG 


TATTTGCCTG 


GAAGCATTTA 


AAGCCCTTCT 


TGGGAAATTT 


41301 


AGATGTTTTA 


TATTTTACTT 


TCTTTTTGAT 


TTTGCTTTTT 


CCATTAAAGT 


41351 


GATTACTATT 


TTTAAAGAGA 


AAACCGAAAA 


CTCTAGAAAG 


ACCATCTTTT 


41401 


CTTCATAACA 


GGTAGCAGAA 


AACACCATGT 


TATTACATTT 


CTAGCAAGAG 


41451 


CAGTAGAGGT 


GACTTGTTGG 


TTTTGTGTAC 


TGTTGCTTTA 


GAAATTGATG 


41501 


TAAGGCTTCC 


CATAAACGTG 


CCAGAGGAAA 


AGAGGGACGC 


AATGGGATCT 


41551 


GTTATTGAAC 


ATTTCAGAGG 


CAGACTCTTA 


CCTTAAATAG 


GGACTCACTA 


41601 


TACATTCATG 


TTTTCATAAG 


TATTGGGATC 


ATGTTCTTAC 


TTTCTATCAA 


41651 


CCTGCTATTT 


TCATCTTTCA 


AGCTTAAGAG 


TAATAGGCTC 


TGTGTGTTTT 


41701 


GTTTTTCAGT 


GAGCCCAACA 


AATTTGTCTC 


AATTTAACCT 


TCCCGGGCCC 


41751 


AGCATGATGC 


GCTCAAACAG 


CATCCCAGCC 


CAAGACTCTT 


CCTTCGATCT 


41801 


CTATGATGAC 


TCCCAGCTTT 


GTGGGAGTGC 


CACTTCTCTG 


GAGGAAAGAC 


41851 


CTCGTGCCAT 


CAGTCATTCG 


GGCTCATTCA 


GAGACAGCAT 


GGAAGAAGGT 


41901 


AAGCGTTGAG 


GGGGATTAAA 


GATGAAGTCA 


CTTTATTTAA 


ACCCTGAGAG 


41951 


GGAAACCATC 


GTGTCACTCA 


CATCACAAAG 


ATTCCTGAAG 


AGGAAAATAA 


42001 


ACTAGTGTAA 


TTATCATTTG 


GGAAACTAGA 


AGCTTGAAGA 


AGTTTTATTC 


42051 


TGTATTATCT 


TCTATTTCTT 


TATGTATTTG 


GAAATATGCC 


AG AATTTG TT 


42101 


TATATTAATA 


CTTGGCTGTA 


GAAGAGTTTA 


GACTAAATCT 


ACTTTTCCAA 


42151 


TACAGAAATA 


TACATATAAA 


CTATTTTCCC 


AGGTGCATCA 


AATATCAGAG 


42201 


CAAATGTTTT 


GTTTGACATT 


TTGGTTAAAG 


AGCCATAAAG 


ACACACAAAC 


42251 


CAGAAACATT 


ATTTTATGAA 


AATACCACAT 


GTTGCTGACT 


TTTATTCCCA 


42301 


GGAATTCCCT 


CTGGTGCTAA 


TTTTTTATTA 


TATCATTTTA 


GAATTCATAT 


42351 


TGTACCTACT 


TTTTTGCTTT 


AT AAG TC ACT 


ATTTCTTCAT 


CCAATGGCAA 


42401 


TAAAATTGTC 


ACCTAACCTA 


ATAAATATCT 


TTATAGTTAT 


ATAGTTCTAT 


42451 


GTAAATACTC 


CAAATAAATC 


AGCTTGAAAA 


CCTCAGGAAG 


CTGAGTTGAT 


42501 


GCTCAAATAT 


ATATATTTTT 


GTAAACTGTA 


GAAGCTCAAA 


TGTCAAATTT 
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42551 AACAATAATT TGAGAGACTT TTCTCTTTGA TTTAATGAAT TTTTTTAGTA 
42601 TCCATAAAGA AAACTTACAG CATACATATT ATAAAGCATG TCAGCTAAGG 
42651 ATAAAATAAA AC TAG AC ATA CAAATTCAAA CTGATTAGAA TGAAATTATT 
427 01 AACC CTAAT A ATTATGTTTA AAAGAAAAGT CTCCAAATCT TGAGACATAC 
42751 CAGAGTTTAA GTCTTCAGCC ATCCATTTAC TTGTGGTATA AACTTAGGCA 

42 801 AGTTTCTTAA CCTTCTTATC CCTAAGTTCT GCATCTGTAA CTTCTTAGGT 
42851 TTGTCACAAG GATGAAATAT GAGAACAAAG AATAATTCTG TTCCATGATC 
429 01 TTTTCCCTTC CTACCTTCTT ATTTAAAGTA TCTTCTGACT GXGGGGTTAG 
42951 GCAGCAATGA AAATTGACTC ATGTTTTTCA GGTCACCACT ATGGATTCAA 
43001 TATACTGGCA TTAAATC AG T AGAGAATAGT TGTCATTGCC TTTTGCAATA 
43051 TTAACCAAAC CACTCAGTTC ACTG TGAC AG ACAGTGAATT ATATCCAATG 
43101 ACTCCACTGA TTTTTTCCAT GTAGATAGAC AAAATATAAC TACTCTCAAA 
43151 TGTAAGGACC CTGCTTTCTG AAATGGTTCT GTTGCTCTCT TCACAGATAG 

43 2 01 GCTTCTTATA ATACTTTTAA AATAATTTGC TAAGCATACA GATGGCTTTC 
43 251 TAGAGTGTGG CATTGACAAA TAAAGTGATT TTTATATACT GGGAAATTCT 
433 01 GGCCTTCAAT GTATCAGGAT TAAATAATCT GAATTTCTGA AAGCTAGCCT 
43351 AAGTGGGCAA GATGGCTTTT TTGTGCTCAC GCATTGAATA CTGAACTATT 
43 4 01 CTAGTTCTTA AATGGCGATC TAGATTCAAG ACTTATTGAA CTAGATTGAA 
43 451 GGGACTTTAT TGATATCCTA CCTAATGCTC AC AC TGAC AG ATGAAGAGAC 
43 501 TG AG CC AC AT GTTCTAAGGT CATAAACAGA AAGAATGAGA ATGAGATGGT 
43 551 CTAATTAATT GTCCACCTTT CCTATGGTAC ATCAGGGTAA CACTTTAGTT 
43 601 TACGAGGGTA TTATTAGAGA TAGAAAGAAT TTTTTTTTAA ATAATTGACT 
43 651 CAAATACCAA C ATTTTG C AC ATTACATAGA GTAATAGCTT TGCCCAAGTT 
43 701 AGAAAACTGG GGGTTCTTCT TTATTCCTCT TTTGACCACA TCTATATACT 
43751 CAGTTTTAAA AAGGTTCTTC CTGGTATCCT TCAATTCCAT CCCCATGTTT 
43 801 TCATCTACAA GCCTAGTGCA GCTATTCCAG CCGTCTCCTG ATCAGGTCTT 
43 851 - AAGCACCTCC CATATGTCCT TGTAGTACCC ACCATATTGA TCTCAGTAGC 
43901 AATCACAGTA CTCTATTGTA AATATCTTTT AAATTATTAT CTTCTCTTTG 
43 951 AGCTTTTGGG ATTTTATCTT ATTTATTTTT GTAGTTCCAG GATCTAGCAA 
44001 CAGCTTGTCA CATCGTTCAT ACTCAACTAA TGTTTGTTTA ATGCACAATG 
44051 AGCAGAAATA AACATACTAC TCCATAGTAA AAAGAGGATG AACTTTTCTG 
44101 CAAATATTAA TCAGCACCAT TTTATCCACC TTTTGGGTTT AGTACATTGG 
44151 AAGTATAGGA G TATAAAGC A GAATGTCCAA TGTTTACAGT GATATTTTGA 
44201 AATAGATAAA AGCCAGTGCG ACATTTCCAT TCTCAATTTC TCTGAGACAT 
44251 CACCTTGAAA AAAAAAAGTA TTTTTCTCTT CCTAAAATTA GTAAAGGAAC 
443 01 AGTAATTCCA CATTTATAAG AGTATGATCA ACGCATCACA GATAATGTTG 
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443 51 TAATAACACA TTAGATAAAA GTGCTTATTT TCCTGAAATT ATATGGAGAA 
44401 AAAAATCTGA AAGTGGACCT TTGTTGGATA CAAATGAAAT AAATAAGGTA 
44451 CATACATTTT TTAAGGTTCG AAAGTTTATG GCAACTTTAG TTTGGGTTTC 
44501 CATGCTATTC TATTTATTAT ATGGGAATTT ACTGTAGCTT TCAACATGTA 
44551 CGAAACAGGC TGGTAGGGCT CATGCTTGTA GGCTTCTGTC TAATAACTTG 
44601 GCAACTGAGG TACTTTAGGG AGTATGGATG GGGCTCTTCC ATGTCTCAAC 
44651 GTCCTGACTG CCAAAAAATT ATAGCAGGCT GGTTCTCAGA ATCTTATAGT 
447 01 TAGTTGTTAT TACTTAATTT CCCTAACCAC CCGTTCTTTA CTTTTTCTGT 
447 51 AAAGGCTGGA ATTTTTGAGT AGACCTTATT GTTTTAACTC TATTGTTCTG 
44801 TTTGTTTTCT CCAGTTCATG GCTCTTCATT ATCACTGGTG TCCAGCACTT 
44851 CTTCTCTTTA CTCTACAGTA AGTAATGGCT GTTAAGAAAA AGCTTGTGCT 
44901 TTTGCCATGC ACACAGATGA TGAAATAGAT CATTTTACTG TGAACAGATC 
44951 ACATTCATCT ATGACTTGCA CAGGAGTTGT GTAGCAAAAT AACGGCATAC 
45001 TCTAAGCTGC CCAATACCCA ATAAAGTGCC AGGTGCTCCA CCTGCCATTC 
45051 TTTGGTCXCT TACATGTGCT TTCACTTGGC TTTTGTGCAC TCATCATAAT 
45101 CAATGAGTGG ATGTAGAATT CGATTTCATA AAACCTACTG AGGTATGACT 
45151 TGGAGTCTCT GAAACCATGT ATGTAGTCTG CTATAC TATC ATTTTAGTAA 
452 01 TGACGAGTTG TCCATGTTTT GTTCTTTGAG CCGTGACTGT TAATTGTTCT 

452 51 ATAGTATTTT CTTCTCATTT TTTATTTTTA AGTTTATTGT TGAGAGGATT 

453 01 ATCGAAGGGT AAAAGCAGTA AGGGTAAAGG GTAAAAGCAT AAAAGAACCA 
45351 GAGATGTTTT TTTTTAAATA TACCTTTTGA AAGAGTGTGA TTTTTTTAAC 
45401 TTTTATTTTT ATTTTATTTT ATTTATTTAT TTATTTATTT TTGAGTCGAG 
45451 GTCTTGCTTT GTCACCCAGG CTGGAGTACA ATGACACAAT CATAGCTCAC 
45501 TGCAACCTTG AACTCCTGGG CTCAAGTTAT CCTTCTGCCT CAGCCTGTCA 
45551 AGCAGCTAGG ACTACAGGCA CGCACCACCA TGCCCAGCTA ATTTTTAAAT 
45 601 TGTTTTAGAG ACAAGGTCAT TGCTATATTG ACCAGACTGA TCAATACCCA 
45651 TGGCTTCAAG CAATTCCTCC TGCTTTAGCC TCCCCAAGTG CTGGGATTAC 
457 01 AGGTGTAAGC CAGCACACTT AGATAGAAAC TTTATTTATT AAGAGAAAAA 
457 51 TACCAGTGTT TCAAGTTCTT TTGCAAACGT GTGACATTAT AATTCATTTT 
45801 TGACAAGGAG AGTTTTTCTG TTTGG TAAAT ACAATTCTAT CTTTTTTAAA 
45851 AAAGTAGCCT ACAGGAAGTT ATATTTTATG AGTGAGTCTT TTTAGAGCTA 
459 01 GGTTAACAGT GAGGTATATT T AAAAG C AG C CTACTGAATC TCAATGGGAC 
45951 TTGAGTACTA TGAATAAGCC TTAATCCTGT ACTGTAAGGT TCATGAAGAG 
46001 TTCATAGCCT CTGCTGTCAC TGATCAACTG AGCATCATGG GCAGTATTTT 
46051 TTTCACTCAT TATCATTAGG TTCAAATGTT TGTTTGAACC TTCTCTTTAT 

F i9- 2 (cont'd 25) 



" 31/124 

46101 AGATTAATCT CATATATTTA CTGCCTTACA TAGTCATTCA AAATCTGACT 
46151 GTTATTGGCA GAAGTAATAT TTTTCTAATC TCTCCTTTCA ATGATTAAAA 
46201 TTACCCATAG CTTCTAGAAA TTAAGAAATC ACGATTAGTT TTTAGGTAAA 
46251 TGTACTTTTT GTGCAAATGG ATAAAG TG AG GAATGTGTAA ACACACATGA 
463 01 AAAAAACACA TAAAAGAAAT ATATTAAGAC TTAGTGTTCC TCCTGTTGGG 
46351 CCAGCACTGC CATTTGTTGG GGAATTGTAT TCTGATTTAA ACCATTGCCA 
46401 TTTACATCTA TGTGTAACAT CAAAAGATGT AGCATCATTA TTATTCTAAA 
46451 TACATACAAT AATTAATATT TGGATAAAGC TACCTTCATG AAACCTAAGA 

465 01 AAAACTAAAT TAAAAAGAAA GAAAGAAAGA AAAATACACT TAGATAGAAG 
46551 AAATAAGGTC TAGTGATTGG TAGCACAATA GAGTGACTAT AGTTAACAAT 
46601 AATTTATTGT ACATTTCAAA ATAGCTAGAA AAGAAGATTT GGAATGTTCC 

466 51 TAACAGGAAG AAATGATATT CTTCCTAAAT GAAGAATGGG ATATTCCACT 

467 01 TTCCCAGATT TGATCGTTAC ACAGCATATG TTTGTATAAT ACCACATGCA 

467 51 CCCCATAAAT ACATACAACT ATTGTGTATC CCAATATTAA AGATTTTTTT 
46801 GAAAAATTTA TTCCTCAAGA AAAGGATCAT GAGTTTAAGA AAAAACAGAT 

468 51 TACTAGTCTA CCAGTGTCCA GTAGACCTTT CTGTGTTAAT AAAAGTGTTC 

469 01 TGTATCTACA CTATCTAATA TAGTAACTAT GAACCATATG TTGCCATTGA 
46951 TTATTTGAAG TATATCTGGC AAAGAGATGA ATTGACTTTT TTATTTTAAT 

470 01 TAATTTACAT TGAAATAGCC ACATGTGCCT AGCAGCTACT AGATTGGATA -, 
47 051 GTGCAAGTTT ATAGAGAACA CAAGGGGTAC ATTTG TAG AT AGGAGTGGGA 
47101 TGTCAAAATG ATGAGGATAA TTAGAAAGCA TACATGAGAA ATATTGTTTT 
47151 AAGAGTAGAA TATGAAATGG GAACACAGAT TAAAATAGAG TATGTATATA 

472 01 TATACATATA TATGTGTATA TATATACATA. TGTATGTGTA TATATATACA 
47251 TATATATGTG TGTGTGTATA TATATATATT TATAGGCCAA TATATGGAGG 

473 01 TAGGGTATAT CCTAGTGTTA AG TGAGTAAA GAATGGATTA GGTGATCGAG 
473 51 CCACATGAGA AGGTGATATT ATTAGAAAAT TGAAAGTTGT ATTTGAGATG 
47401 - ATGAAAATGA TATATTTGAA TTGAAAAGTA AACTGTAGTA AAATAATTCA 
47451 AATAAATGAA TATTTGGGGA ACTACTTAAG AGAAAAATCA TAAAACATGA 
47501 GGAGTCATTC TTTCCCCAGT CCGCCATGAT CAGGCCTTAG GATTTAATTG 
47551 GCAATGAGAA AATACCTATG AAAATGCTTT TTAAACTATC ACATGAAAAA 
47 601 GCAATTTATT ATTTTTCATG CCTTCTTAAT AACTCTCAAT AGAGATTTAG 
47 651 TTGATTTGCA TTTTTGCCTG GTTCAATCAA GAAATTATCG CGTGACATCA 
477 01 GGCAAGTTGC CAAATTTCTT TGGACTATAC CTATAAAATA AAATTTGAAA 
47751 ATATTAGCTA GATCTAACCC ATTTGTCTCC GGATGTCTGC AAAGTGGTTG 
47 801 GAAATCACAA GCCTAACCTG ATCTGCAGAG GTGTTACCTT TGGCAAACTT 
47 851 ATGGTTTTTG TGTTTGTTTT GAAATCTAAG GCCAAGCGCG GTGGCTCATG 
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48001 


AAAATACAGA 


AA1 1 Avjv_V-v_Vj 


48051 


ATTTGGGAGG 


CTG AGG L. AVj va 


48101 


GCTG C AG TGA 


GCGCCAC 1 GL. 


48151 


GTCTCAGAAA 


AAAAAAAAAA 


48201 


G G AG AGGG AA 


TCTAAGCCAA 


48251 


CTACCTAAAA 


ATTCAAAATT 


48301 


CTATGGAATA 


TCTAGCAATA 


48351 


ACCTGTAATT 


GACACCTTTA 


48401 


GCCTCCACCA 


ATGCTTTTCT 


48451 


AGTTTTCATT 


GTGCTGTTGC 


48501 


ATTTATATTG 


CTGTCAAATG 


48551 


GTGACTATTT 


AAATACTTCA 


48601 


GTTAGGCATT 


TAAATTTTTG 


48651 


GTATATTAAA 


AACTATATTT 


48701 


ATTTTTTCAT 


G AGAGC AAG T 


48751 


CATGTTGCCT 


AACAAATAGC 


48801 


AAATTTGAAT 


GGATAAATGG 


48851 


CATTTTTATT 


CAGTCAAGTG 


48901 


GGGCTTGCTC 


AAAACCATTG 


48951 


CATCATGATA 


TCATTAATCC 


49001 


ATATTCTGGC 


TCATTGCTTT 


49051 


TAACCACAGA 


G T AC G AAAG A 


49101 


AAAAATTGGT 


AGTGAAAAAA 


49151 


ATTTAATGAT 


T ATG C TTG C T 


49201 


TAACAAAAAT 


G CGG TG AG T A 


49251 


TTATTATTTA 


AAAATACATG 


49301 


GCTTTCTATA 


ATTCTAGGAA 


49351 


GGCAAAAAAT 


GTAAAAAGTA 


49401 


GAATGCGCAG 


TGCACCTGAG 


49451 


AAAAAAAAAA 


AAAAGAAAGA 


49501 


CAAAGTACTA 


GGGAAAACTA 


49551 


TATGTTTTAA 


GGCAATGAAA 


49601 


TATTCAAAAA 


CCAAACTGTG 
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GGAGGCTGAG GCGGGTGGAT CACTTGAGGT 
GGCCAACATG GCAAAACCCC GTCTCTACTA 
GTGTAGTGGC ATACGTCTGT AATCCCAGCT 
AGAATCGCCT GAACCTGGGA GGC TGAGGCT 
ACTCCAGCCT GGGCGACAAA GCCAAACACT 
AAAAGGAAAA GAGGGAGAGG GGAGGGAGAG 
CACTGTGAAA TATTGTGAAA TATGGAGCTT 
TTAAATTCCT TTTAAAAATA ATTGGAATAT 
CTAAGATGAA ATTCCTCTGG GTTTTCAGTC 
GATGTTGGCA TGGGCTCTCA GGAAGCCACA 
TCCTGACACT GAAGCTAAAT TTGGGTGGCT 
TTTCCTCATG GGAAAGAAAT ACCCTTTGCT 
GGAAAATGAA AGACAGCCAA GGAAGATCAT 
AGTCCATTTA TTC TTTATTA GCCTTGTCCT 
ATCCCTGCAA TAGATGTTTT TTGATTAACT 
AACCTGTTTT GAATTTGAAT TCTAAATTGT 
GTCATTTTTG ATTCATTGTG GATTGTTTAA 
TAATACTAAC GTCATAACTT TTTAATTAGT 
CCACTTATTG GCTTATAGAA TAAATAAAAA 
TTTCATATTT TTTATCATCT CCAGGACATT 
TTAAAAAAAA AATGGCAAAT AATCCAGTTC 
CACACCTAAG CTACTGAAAA AAATATATTA 
ATTTTTATGG TAACACCCAC CTGGTATTAA 
AGGCAAAGGT TAAAGCAAAT AATAGTTTTG 
GTCATGCTAT ACGGTATGTA TATAATAGAT 
ACTAGTATAT GTAACAGGAC TATTATAGAT 
TATTTCTTGA TTATTTTTTA AAAGAATAAA 
AATTATTTAT TGATTCTTGA ATCTTTAC C A 
GCCTAGAAGC AGAATTGGGC AGGATAAACT 
GGCCGGGCAC GGTGGGCTAC AGTGAGTCGT 
TGATAGATCA AGATCCTGTC TCAAAAAAAA 
AAGAAAGAAA AACAACAACA AAAACAAAAG 
ATAGACATAG TTACATAGTT AATTGTGCCA 
CTTTTATCTT AATATTCCTT GCTTACTTTT 
TATAAAACCT TAAAATTATT AGGATCTAAA 
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49651 


AAATAAAATC 


TTTCCTTAAA 


AATCTAAAAT 


TGAGATGTAA 


ATTATTCAAG 


49701 


AGTGCTTTTT 


AAAACAGTTT 


TCTTATAAAG 


GCTATTAGGA 


TTCTACCACT 


49751 


TAGCCACTTT 


ATTATTTAGC 


CACTATATTA 


CTAAGTTTAC 


ATATTTTTAA 


49801 


AGGTAGTGAA 


AATATAGGGA 


AG AC AAAG CT 


CAGGTTAAAA 


GAGTTTCTGG 


49851 


CAAATAAAAT 


ATATCCTGAT 


GGTTAGACTA 


CTTTGCTTTA 


TGTTTTCTGA 


49901 


AAGAAAAGCA 


GTAAAAAACA 


GTTCAGGTAG 


TTTTGTGTCA 


ATTAATCTAG 


49951 


AACTATACCA 


AAAGTAGACA 


TAGAAAACGA 


GAGATTGTTT 


TTCAGCTTTG 


50001 


GATCTGCTTA 


TGGCAATAAG 


CAGACTTGTA 


CTATTCAACA 


AC ATTATG C A 


50051 


TTCTTCAACT 


TTTCCCAGAA 


TAAGGGAGCT 


TCCCAAATGC 


AATGGTGCAC 


50101 


ATAACTCATT 


TTCTGGCATT 


TTGCAGCCCA 


GCATGAAGAA 


GAAAAACAGA 


50151 


GCTAGGAGTT 


TTCTGGAAGT 


CAAGTCAAAA 


ACACCCTGCA 


AATTCCTATG 


50201 


GCAGTCCTCC 


TTTCCATAAG 


CTGCATAGCC 


AAAAATGTTT 


GCC AG AC AC T 


50251 


TTTATCACTG 


GGTGTTTCAG 


TGTTTTCATT 


GTTTAAGCGT 


TTTGCTG AC T 


50301 


TGTGATAATT 


AAAATTATTA 


ATAATCATTA 


AAGAAAGAAA 


AAGTAGAAGT 


50351 


AAATAATGTT 


AATTATCTGT 


GGTTATCAGT 


AGAGGTCTGT 


ATGTTACCCC 


50401 


AGCTTTATTT 


GACATTGTTT 


GTGATCAGTA 


AATCACAGAA 


TAAAATTCTG 


50451 


ACATCTAAAC 


CTTGGCTAGA 


GGTCTCTATA 


ATTTTATGGA 


GTCTGTTTCC 


50501 


TACAATCTGT 


ATGAAAGATA 


CTTCAATATT 


TTAAG TTTAC 


ATGCACCCAT 


50551 


CTTTTTTAGA 


GTATAATTTT 


ATAACTATTT 


GGTTTATGTT 


GCTTATGATT 


50601 


TACATCTTAG 


AGTCTTTTAA 


TTCTGTCTTT 


TGCTTAAAGG 


AATATTATGG 


50651 


ATCAAATGAC 


CTATATTTTA 


AGAATACCTT 


ATGG TTTAT A 


TATTAAGAAA 


50701 


CATTTATATA 


AAATTCTAAA 


GTAACTTGCT 


TGTACTATTT 


CAATTGAATA 


50751 


ACTTAATGTA 


TTTCATTCTA 


TTCTTCTCAT 


AGTAGATAAT 


AAAAAGTACA 


50801 


TCATGATTAT 


TGTATTCATT 


TATACTTGTG 


GAATTAATTG 


AAAATAGTTT 


50851 


TTATAGTTAA 


AGTCTTTCTT 


TTTATTGTTT 


TACAGGCTGA 


AG AAAAGGC T 


50901 


CATTCAGAGG 


TAAAAAAAAA 


TATGCAATAT 


TTTAATATTT 


TCTATTTTAG 


50951 


- TTTGCATTCA 


TGATGAAATT 


AGTCTTGTGA 


CCACTAGAGG 


GCTCTGTGAT 


51001 


ACAATAGCAG 


AACTCCACAG 


GACTGCTGAA 


GTAAGGCAGC 


TAATTGATAA 


51051 


ATGGTCTTTG 


ATATTGCCTC 


TTAAAAATAA 


AATGAAAGGA 


AGTTTGTATA 


51101 


GCAAGCTGTC 


C TTTC AC ATT 


C TAG ATTG AG 


TCTTAGCTCA 


ACACCTAATA 


51151 


AGTTTTCTAT 


AATAGTAAGC 


ACTCATTAAG 


TCATTGATAA 


ATGAAGGTCT 


51201 


ATGGTCTTCC 


TATTTTATTA 


CAGTCTTTTT 


CCCACTCCCT 


GTAAGACCAT 


51251 


CTACACAGGA 


TAATGGTTGA 


AACTTGGGCA 


CCAAGCCTCC 


ACAACACAGG 


51301 


ATACTAGCAT 


CTCAGACTAT 


CTGTTTTGTG 


TCATTATCTT 


GTTGCCTCTA 


51351 


ACTGCCATTT 


TATGTGTGGT 


GTGTCACCTA 


TTGTTCTAAT 


CACATATTTC 


51401 


ACAAATACAT 


ATTTGGTTGC 


ACTCGTGAGC 


AAATCAAACT 


GCATTCAGGA 
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53201 GTGGAGTCAG TGTTCATATC C TTT ATTTTC AGGAGTTGCT GCTGATACAA 
53251 TGGGGTTAGA ATGAGCTAAA TACAGCATTT GCTTTCTTGG TTTGAATTCT 
533 01 GGGTTTTAAG TAAAAATCTA CTTGCCTATT CCATTGATTT TTTTAATTGC 
53 351 ATTCAGCAAA TCCATAAACT GCGGAGAGAG CTGGTTGCAT CACAAGAAAA 
53401 AGTTGCTACC CTCACATCTC AGCTTTCAGC AAATGTAAGT CACTTCATTT 
53451 TTAAAATATA TTACAACAAA TTTTTATAGA GGAAAATGAA ATCATTTTAG 
53501 TAACAAACTT ACAAATTTTC AGTGCCTGAT ACAGACTTAG ATTACCAACT 
53 551 AGCAGGACTC ATAAAAAGTT AACATTTTTT GCCTACTCAG TAATAAAATG 
53 601 TAAATCCAAA CTGATGAGAG GCAGCAATAT GGTTAAAATG GCTTGTTGTT 
53 651 TCTAATAAGA TTGGAAACAA TAGTAACAGC CATATGGGTT ACTTCTTTTC 
53701 TTGTTTGCTA TTTTTATTAC TCCTCTTGCA TAAGATTCCC TGACAATGTA 
53751 AGAGGGGTTG TTAGTGTTTG ACTTTGGAAG ATAAAATATT CCTGTGCCCA 
53 801 GCCTCCTTCA TCTCAATGTA TTGAACAATT TGTTAAGCAT CCAGTTAATT 
53 851 CTAAAATATG AAATTAGGTC TAAATAGGGA TAGCTTAGCT GCACTGTGGA 
53901 TGAGATATGG TTTGCTCAAA AAACCTTGGC AGCCTTCTCA TAGCAATTTA 
53951 AAAGGG TAC A CTTTTACTGG CACCAGAGCA GCCCAGGATG GCAGAAATGA 
54001 TGACAATGAA GACCGTCAAT TAAATTAACA TTTACTGAAT ATCTTCCACT 
54051 GTGTCAGGGA GCACTCAGAG TAGATGCAGA ATGATAAAGG AGAAATGTGG 
54101 CACTGTTCCC AGTCCTGAGG AGCAATGGTG TTAAGAACAG CAGTGAGGGG 
54151 TAAGGAAATG CCTGCTATTT TGCCATATGT CTTACCTCTC TCACTCAACA 
54201 GTCCTTTGCT CAGTTCTGCT GCATAGCTTT GGGCCTGCTC TGTGCCTCCC 
54251 CACCCCTCCC ACTGCTCCTC TACTGAGTTT TTCTATCTCC TAGACAAAGC 
54301 ATGATATGTC AAGAGTGAGC AGGTGCAGAC CCACAGTGTA AGACTTGAAT 
543 51 AAGAGCCATT TTTAAATTTT TTTTAAGCTA TCATTGTGCA ATATAAATTC 
54401 TAAGTATGTG TATCATTTCA TTCACAATGT ATTCATTTTA GCACTGTATT 
54451 TGAATTGATT TTATTTTCTG AAATTTGGGA GAATTAATTT TGGATTTATT 
54501. CTATTTATTT TTAATAGATG GTGTTAGGAG ATTCCTGAAA ATAATAGCAG 
54551 TTTTTAGATA ATTGTTTAAG CAATATGAGA AAATAAGGGT ATTATTTAAC 
54601 CTTGTTGTGT TTTTAAAGAG ATAGTCCAGA GGCAACCGTA AATTTTATAA 
54651 TATAGGCTAC ATGTATAGAA GTATGAAATA TTGTTGTCTA GGTTCCTGAA 
547 01 TTTGTACCCA GAGGAAGTAG AATAATGTAA ATGTCAGAAC CTCCTGGGTT 

547 51 GTGTTTATCT GCAATAAGAA AGGCTCAATG GCAAACCTTA TTTATTAGAT 
54801 TGTCAGGATA CTTGCAGATG TCTTGAATGA TTACTCAGGG TTTCATTTTA 

548 51 TTTTTAATGT CCCTTGGTTG AGCTCATCAT ATAATTCAGA TATTGGAATA 

549 01 ATAAATGGCT GCTAGACATA GTGGAAGATG GGCTGATACT TTCCATTTGA 
54951 AATGTAATGA TGCTTATTGT CTTCAAAAGA AAAAACTAAA ATGGTATTTC 

Fig. 2 (cont'd 30) 
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55001 ACATTTTTTT GTTTTTGTTT TTGTTTTTTT TTCTCTGAGA ATCTCATTCT 
55051 TACTCATGAT TATTGG TTTC TTGTGTACCA TTTCAACATT TTTCTATTAT 
55101 ATGCTAATGT GTATATATAC TTAATACACA CGTGCAAAAG CTTCCACACA 
55151 CACACACACA CACACACACA CACACACACA CACACATACA CACACATACG 
55201 GAACCAAATT CTAACATAGG GGAATAATCT TCGGAGTGAA CTCTGTGCTG 
55251 CTGTTTGAAA ATGGAGATAT AATTTTAGAA AGGTTCCTGC AGTTGGCTAC 
553 01 CCACCTCGTC TGCTCTAATT ATGCTTGTCA CACTATTTTC ACTGATG TGT 
553 51 TTTCATGACT TTAGGGCATG AATTCTCAGC TGGGTGTTAA TATGACCAAC 
55401 AAAGGGTGAA AACAGGTTCT TGCATTTTTT TAAGTACTCT TTTTATGTGA 
55451 AAAGCACAGA TATGCAGATA ATACATAACT GAACATCCAG CATATCTGTG 
55501 GCTTTAAAAT ATCACGAAGA AGAGCACAAT TAGGGAAAAG AAAACATCTA 
55551 TAGTGTTTCC CTAGGGGAAC AATCATTTAA AAAAAAATAA AAATAAGGAA 
55601 CACAGACTAG AAGCAGCAGT GCCAAATAGA TAATTCATGC TAGTCTTTGT 
55 651 GTTAATTTAA AAAGTGCTAG TCTTGGAGAC AAACGCCCAA ATTGCTCTAG 
55701 GTTCCACTCA GCTGTATGTG TTATCATTAG TATTAACTTT TGCACGCTGA 
55751 TGGGAGACTG ATATATATCC TGTTTTATGT TCCTTTAAAC AATTTATAAT 
55801 GTAATTTAGA AACCTTCTCA AATCACATTA GATCCACACA AAAACCTGTA 
55851 CATAGCAGCT TTATTTTTTA ATAGCCAAAG AAAGGAAACA ACCAAAAATA 
55901 TCCCTTAATA GGCCAGTTAA TAAACAAATT CTGATACATC TATATCATGG 
55951 ACTACTACTC AGCAATATAA AGAAATGACT ATTGATACGT GCATCAACTT 
56001 GGGTGGATCC CAGGGGTATT ATGCTGAGTG AAAAAAGACA GTTATAGAAG 
56051 GTCAAATTTT GTATAATTCC ATTTATATAA CATTCCAGAA ATGGCAAAAT 
56101 TAAAGAAACA GAGAACAGAT TAGTGATTGC TAAGGGCTAA GGATGAAGGA 
56151 GAGAGAGAGG TAGTGTGACT ATAGGAAGAG GGAGATCTTT AGTTTTGTAT 
562 01 TTTGAATGAG ATGGCCATCA CATGAATCCA CATATGTCAA TCTATTAATG 
56251 TAAATCAATA TTGTATTCCT GGCTTTGATA TATAATATAA TTTTATAAGA 
56301 T ATATAATCA TTGGGGGAAA CTGGATGAAG GATACAAGGG ACCTCCCTGT 
56351 ACTATCTTTG CAACTTCTTG TGTATATAAT TATAAAATAT ATAATG TATT 
56401 AAAATGTATA AAATAATATT TTAAGTATCA GATACTGATC TTTACTCAGT 
56451 ATATGAAGTG TTCTATCATA ACGTAACATG CTTTTCCTTT ATTTGTGGTA 
56501 TTTTAGTTTC AAACTAAAAT ATAAATCACC TAAAGATCTA CGACAGTTCT 
56551 TTTGAAAAAA AATCTTGCTT TTAATTTCCC AGGAGTTTCA ACCTTAATCC 
56601 TCTCTTTAGT GTTTCTTTAT TTGGTAGTGA TAGGGACTAT CAAAGCTTCT 
56651 TACCATCAAA TACATTTACT GACTAAAAAT AGAAAAATAA TTTACATTGT 
56701 AAAAATGTAC AAATTGAATG ACAGTCAAAA GGTACAGGTA ATGAAGATAT 

Fig. 2 (cont'd 31) 
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AAAAAAGTTT ATTAAAATTC TCTTTTAGAC 
TATATAAATA GATATGTATA TAAATGACTA 
TTATATTTTA AACATTTTAA TAATATTGTT 
TTGTATATAT CTGAACATTT TTGTTCTTAA 
ATCCAGGAAT TTGAATTACT CTCAAGTCAC 
TGAACTCATT CATTTTCTTT GTGTTTGCTT 
CATGCATCAT AATCAGCCAT CAAATAATTT 
TCAGTTGTAA GAAGTGCTGT GTTTAAATTT 
ATCTGAATTA ATATCTGTTA ATGTATGTAA 
GCATGTACTT AAATTGATTA TAGGGACTTG 
GATATTTTAA ATATAATCAA GGATTTTTTA 
AAGTAAAAGT AAGTCTTTGT TTACTAGTTT 
TTCTACCTTT TGGTTAAATG TGAGTGCAGC 
ATATTAGAAC TCTAATAGCT ATAATTTATA 
TGGTTCTAAG AAATAATCTG TTGGTTTTAA 
TATTCATCTT TACTCAGTAT GTGACATGTA 
TTCCTTTATT TGGGGTGTTT TTTATATATT 
AAACTTGGCA TAATTACATT TATATGGACT 
TAATTTTTAA ATTTGAAATG CATCTATGTC 
TCCCTTTGCC CAAATGGGGT ATGGTAAGTC 
CACCTCTCAT TTGACTGGCA GAG TAAAGCC 
TTAAGCCTTC CCTCCCTTTT GTAAAGTTGT 
ACCACAGGTA AAGTGTTAAG CTGATTCTAC 
GAGCGGTTAT ATTGGCAGGT CCTATTGCCT 
CACCAACAAA CAGAAAACAG AAGCCGCACA 
AATGGTAATA GCAAACAATA AGC C AG AT AG 
CCTTAAGGCA GCTGGGTCAG GTGGGATGCT 
TTTTCTTTAC AAATCTCAGC CATTACATAA 
TAG TT ATTAC TAACATTTTT AAAGACATTA 
ACAAAAGGTC CTTTTTGCCT TTCAGCAGAT 
TCTTGGGTAT TTTGAGATTG TGTTCTACTT 
CATAAAATTT GCTAGTTACT CTTGCTTTTT 
ACAATTGGTG CATATTATTA ATGAGAAACA 
TTTGTAGTGG AAAAGAAATG TGAAACTTTA 
TTCAACTAAC TACTAATTTT AAAACATAAA 
TTATATGGGT AAATATATAT TACATATAAT 

Fig. 2 (cont'd 32) 
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AAAATATAGG AAAAAATGAG CATAGCCTTA TGCTAATTTT TCAGTTACTA 
GGTCTGATAT CATCACATTC CTTGCTTGTC ATTGAAAATT TTTTAAACTA 
TGATACTTTT TTTTAGTGGT ATTTATCCAA TTAAATCTGC TAACAAATTT 
GGTGTATAAA TCTCAAGGGT AAGGGTATGT GGAGAGTGGG TGTGTTTGTG 
TGAGAGAGAG AGAGAGAAGA GGGGGAGGAG AAAAAGAAGG AAGAGGGAAG 
GAATGGAAAA AGATAATAAA GAGTTGTTCT GATAGATTAA TCTTTAGTAG 
ATGTATTCCC TACAAATTGT TTTTCTCCAT ATTGCAGTGT CAGGTAAAGA 
AAGGCATCCC AGGATGAATT CAGAGCTAGG AACATGCACC TTTGTATCAT 
AATGCTAATG GAAGGAACAT GTACATTCTA ACTGTTACCA ATAATGGAAT 
ATATTTCCGT TATTAAGTAA TAAGCTTTAA TTCTTTGTAT TTTTGTGATC 
CATTTGATAG TAGGTGCCTC AGCATTTCCA CTCTGCTATA AGTACATGGA 
GATATATTTT ATTTAAGTCA TCTTATTCAT GTCTTTCAAA AAGAAATTCA 
TTTTTGGCCA AGGATTTCCA AATTTTGCCC CATATATAGG TATAGTTTAT 
TATAGACTTC GTTTGCAAAA TATTAAATCC TTATATCCTT TTAGGGACAC 
AATAAAATTT TATAAGTTTG AG AT AATG T A CTTGCAGTTC TACCTCAGGC 
CGTGGTGAGA GATTGAAGTG CCTCTTCATT TTAACATTTT GGGTTCAAGT 
TGTTGCATAA GGGCATGCAA ATGGAAACTG GCCTATTTTT GAGCTTTAAT 
AAAATCGTCA AATACTTCTT AATCTTAAGA GTTATAGTTA TGTACTACAA 
TATGTATAAT TCTCTAATAT TTAAAACAAA ACCTGAAAGC CACAAAAGCT 
TAC TG TG AAA TAAAATGTGA TGGAATATTA TTTCTAACTG GCTTACCTGT 
ATTTCCTTCA TTGAAGGGAA TATGAAGTAG AAAAGCCCTT TTATTGAAAA 
GAGTTTGGAA AGTAAAGATA ACTCTTTTCA ATTCAATTCT TTGTAAGTAG 
AAAAAG AG T A AAGATAATGT TTAGCTGTCA GCAGATGTCT GACACTTGAT 
GGAGCGTATC ATTACAATAG AGCAGCTAAC AATATCTGCA AAGGTCATCA 
TGAAAGTATA AAAATGAGGA ATATTTGTCC ATTGACCATT TCAGTGACCT 
CTTTTTGGGC TTTAAGTCTA AAAATCTTGG CAGATCAGAA CTTTATATTC 
GGCATTTTGA GTGTCAAATC TCTACATGAT GTGCAAGTCA GAAGGAGTTA 
TTACTTGCAA AATACCATCT TCTTTCAGAA GTTAAACTCA CATTAAATGC 
CAGGAGACTG AAACACTGAT TTTAAGAAGA CAAAGTTTAG AAAAGATGAA 
TGAAAATGTG TGTTAAAGAA GAGTCACCAG TCAGAGCTAA CTATGATAGT 
CATAGTATTT AAAGAGTTGG AACACATGAA ATTAAGCATT TTGTAAAATG 
AAGGCTTTTC ATCCATCCAC ATAAGATTCT GACATTTAAA CTATGTTTCT 
TCCATTCTGT TCACAGGCTC ACCTTGTAGC AGCTTTTGAA AAGAGCTTAG 
GGAATATGAC TGGCCGATTG CAAAGTCTAA CTATGACAGC GGAACAAAAG 
GTATGTTCAG AAATTGCCAC TGGAGACTGA AAGAAGACAG CAAATTGCAT 
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63851 


AGGATTCTTA 


AATAATACCT 


GAAGCTCCTT 


AAAAATAATA 


TTCCAGGCTG 


63901 


AGTGCAGAGG 


CTCATGCCTG 


TAATCTCACC 


ACTTTGGGAG 


ACCAAGGTGG 


63951 


GTGGATCACT 


TAAGGTCAGG 


AGTTCGAGAC 


CAGCCTGGCC 


AACGTGGTAA 


64001 


AATCCCATCT 


CTACTAAAAA 


CACACACAAA 


AAATTAGCTG 


GGCATGGTGG 
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CGGGTACCTG 


TAATCCCAGC 


TACGCAGGAG 


GCTGAGGCAG 


GAGAATCACT 
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TGAACCCAGG 


AGGCAGAGGA 


CGCAGTGAGC 


CAAGATCACA 


CCACTGCACT 
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AGACAGAACA 


AAAAAAAGAG 


TAATAATAAT 


AAAATAATAT 
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TCAATTCTAT 


ACTAAATTAA 


AACAATGATA 


ATACCTTTCT 


TTTCAGATTT 
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TAATTTAAAG 


ATTTTATCAG 


TTTACTCCAT 


ATTGGAACAC 


ACAAAGGCAA 


64301 


ACAAAATCCT 


TGCTGGGCAG 


TCTATTAATT 


TACTTCTGGA 


TGGAACTAGT 


64351 


AAAAGAATAC 


TGAATGTTAA 


GAAAGAGAAA 


CAGTCACATA 


AGAGAATATT 
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CTGGGGGCAA 


ACTGTTATGC 


AGTTGACAAG 


AATCACACTT 


TGATAAGAAC 
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TTTCACAAAT 


ACATGGTCAC 


TAAATCCAGC 


TATAGGGCAT 


GGCTGTAGGC 
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AGGAAGGATG 
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CTGTGTAATT 


TTGAAATAAC 
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AAGAACTAGT 
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AGTCATTTGT 
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T AC TT AAAAT 


GAACCTAAAG 


GATTTTATGG 


TATGAAAGAA 
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GGTATACCAA 


AAAGAAAGGA 


ACGGAGAATT 


TAGTTCACGA 


AGACAAATGT 


64751 


ATTAAAAAGG 


TCCATACTGC 


ATAG AAAGC C 


TGGTCACCTT 


TCCTGTGATG 
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AC C AG TT AG C 


TTACTTCTCT 


GCTGTTAGTC 


CAGTGGCCTT 


AACTTCCTTG 


64851 


GATAGGTATC 


AGAGATAGGT 


GAAACCTATA 


GAATTCTATG 


GAGTGTGTGT 


64901 


GTGTGTGTGT 


GTGCGTGCGT 


GTGTGTGTGT 


GTGTGTGTAT 


GAAAACTGTA 


64951 


AATGTGCATA 


AATGATCAGG 


TGTCCAGAGC 


TTTCATCTAA 


TTCTCAAAGA 


65001 


GACCCATTAT 


ATCAGAAGTT 


TTGGGTATTT 


TCAAGAATGC 


GTTCCTCTAT 


65051 


CTATCCATAG 


GAATGGCTTC 


AGTTTTGTCT 


TTAGATTCTG 


TAAGTTATGT 


65101 


GATTAGCTTT 


ACAAAAGTAG 


TATGTATTAC 


CAAATTTTGT 


CACTTTACAA 


65151 - 


AAGTTTATTT 


TTAAAACAGA 


ATGAATAGTT 


CAATGAAATC 


AAAAGAGTAA 


65201 


ATCGAATATT 


CTTATAATTG 


CCAAGTATTA 


TTAGCACATT 


GTATTCTCTC 


65251 


TCATATTCTC 


CGTATACCCT 


GCCCGTGAGA 


GAGAATATTA 


TCCATTCCTG 


65301 


GAAAATCTGT 


TCTAGCACAG 


CTAACAAACT 


CCTTTTGAAA 


CATAAATTTT 


65351 


CCTTTCTTTC 


CTCCCTCCCT 


CCCTCCTTCC 


CTCCCTTCCT 


TCCTTTTTCC 


65401 


TTTTCTTTCC 


TTCCTTCCTG 


CCTCTTTTCT 


ATCCTTCCTT 


TCTCCTCCCT 


65451 


TACACCCTTT 


CTTCCTTCTT 


TTCCCCCTCT 


GTCTCCCTCT 


CTTTCTTTTT 


65501 


TGCTGCAGCT 


TGTCACTTCA 


CTATGTAATA 


TAAGAACCCA 


GCAAATAGAA 


65551 


TTAGAAGGC T 


TTTTAGAGCA 


GCTGACGGGA 


AAGAATAAAA 


ACACTGGCCC 


65601 


CCAGTATTCT 


TGAATGAGAA 


TTCTGGCTAT 


GTCTGTTAAA 


AGCTGGGTAA 
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67401 


ATATAATATA 


TATATAAATA 


67451 


ATATTTATAA 


ATATGTAATT 


67501 


GGATTTCAGG 


TGATGATAAG 


67551 


GGATACTGAG 


AAGCTGGTTT 


67601 


AGGTGGTTAC 


ATGTTCTTGA 


67651 


TAAGACTCAG 


ATTCTAGTCT 


67701 


ACATdGCATT 


AATAGCCTGA 


67751 


GACCAAAATA 


ACTCCCTTGA 


67801 


CTATTTCCAG 


ATGCCAAATA 


67851 


AGAACACTGG 


TTTTATATAC 


67901 


ATGAATAACT 


TAG AAGG C AG 


67951 


TGTCATGTGC 


ATTCTCAAAG 


68001 


GAACCCTGGA 


GGTGAAGGTT 


68051 


CAGCCTGGGT 


GACAGAGAGA 


68101 


AAATAAAAAA 


TAAAAAATAG 


68151 


AAGCAGTATT 


CCCAATAGCT 


68201 


GTCTGTAAGT 


AGGTAAAAAT 


68251 


GTGAAATTAT 


CTCTAGGCAT 


68301 


CTATGATATT 


GGCGTAGTTT 


68351 


TTTTC TCTTC 


ACCAGCACTT 


68401 


CCTGATGCCC 


TGAGGCATCC 


68451 


ACCATCTAGA 


ATGGTACTAG 


68501 


ATTGCTACTC 


AAATCATTGT 


68551 


GAGAAAGGCT 


CAACTTCTAG 


68601 


ACATTTTTAC 


AGAACTACTT 


68651 


TATATTCACA 


GCTAAATATT 


68701 - 


TTAGTTCCCT 


GTAGCTGTCA 


68751 


ATGAGGGGTT 


TTTCTTGTTA 


68801 


CTTGTATGAG 


TGATGAAGCT 


68851 


CCTAGTAGTT 


TGCTCATTTG 



68901 TACAAAACAG GAATTGAAAC 

68951 TTTGAAAATA GATAATGCTG 

69 001 ACATTAACTC TCCCTAATTT 

69 051 TTTAAAAAAT ATATTGCTGG 

69101 GCATAGAATA ATGTGAATCT 

69151 TTTGGTCAAT GTAATTTATT 
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TATATATTAT ATAATATATA TATAAATTAC 
TATATATATA ATATATAATT AAAACATATA 
CACTACTGAA AAAAGTAAAG CTGAGAATGA 
GGAAGCTAAA ACACAAAGTA ACAAAGGCCA 
TTACATACTT TAAAAATGGA TAAACTAAAT 
TTGGGCTTCA CAGTGTGATT TTCAGCAATC 
AACTACATCA AAATTGTCAT TTGATTTATA 
ATAGAGAGGG ATTCACTCCT AACACTTTTC 
ACACGGAATC TCTTGCCAAA TTTGTGTGGC 
TTATAGCCTG GTAAGAAAGA AAAGACATGT 
AAAATTATCA TGCT ATTAG A CTCAGTACAA 
GAAACATCTG CAGAGGCAGG AGAATTGCTT 
GCACTGAGCT GAGATCATGC CACTGCACTC 
GACTGCATCT CAAAAAAATA AAAATTACAA 
TGATCAATCT GGCAGCATTT TCTGAAAGTT 
GCTAAAAGAA GACATGTTAT ATAATACTAA 
TAAGAGAATT GTTAATG TGC TTGCTGGGGA 
TACCCTATAC CTAACCTAGG ACTCAGTAGA 
GACCAAGAAT TTTATCCTGA TTTCAGATCG 
CTTCACCAGG ATTATATGAA AAAAATTAAA 
ATTATATGTG CTGAAATAAC TTCTTTTCTC 
CTATGTACCA CTCTTGTCAG AATCAAGGAA 
GCAGCTTAAT TTTCTCACAG AAGGCCAGTT 
GAATCCAGCA AACTATATTT TTTATAAGTA 
CTAAATCCTT GTGTTCAAAT TTACTAAAGC 
TCAGAATTTA AAATTTAAAA GACTTTCAAA 
TGCCAAGGCA ATTAGAACAT ATGTTAAGGT 
GAAGGTCAGA GCAGGGCAGA GAAGTAGCCC 
CAGATATTGA CTCCTATGCT AACCATAAAG 
TTACCTCTCT GAAACATTTT TTTGGGTGAC 
CTTCAAAATA AGGGAATTTG AAACCAAATC 
CAACTAAAAA TTTAGTTGAA TAAGATTTTT 
ACGTTATGAT ATTTGCCATC TAGAAGTGTT 
AG TC AG ATG A TGCATCCATT AATCTTTGGG 
AAAATTTTCA AATTATTTAC ACTACTGGTA 
TGAAACTAGA TGCAATAGGG ATGGCCAGGT 

Fig. 2 (cont'd 38) 
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692 01 TATTTCAGTA GAACAACTAG CAAGACTTCA GATGCATGGT GGAGTGGGGA 
69251 AAGGAGGACC TGTTTAAGGA AACTAGAGCT GGGAAGTGTG AGATTAACTT 

693 01 AGTGCCAATG TGAGGACCTA AAAAGCAGAT GTGGTGGAAA ATTTAAACAG 
693 51 GCTTGCCTAG AAGGTCAAGT TAGTTGATGA CACTTGATGA GATTGTCC C A 
69401 AGCTTTGGGA TTCTCAACAA AGTCTTTGTT AGTGAGAAAT TTGGAAAGAG 
69451 ATCAGGTATA GTTAAGAAAC TGGGTTGGAA AGGCCACCAG GAAAGGCGAA 
695 01 TATTCTGACA CAAAATTTGA TCATTTTATT TGGAAGCATT TCAAGCCTGA 
69551 CCTGAACGAA TTGTTTAGCC TCAGATACAT GCATAAAACT GTGAAAAGAG 
69601 ACATTGACTC AATTTAGCTT CTTTAACATG AGAAACTTTC GTGGAAAACT 
69651 AGAACTTTAC AAGCTCAGCT GGTG TTGGGG GCATCATTAT CTTGAATAGC 
697 01 TCACTGGAGG AAAATGAAAT CTTAGTTTGG TTCTCAGGTT TTAAAATATC 
69751 TATCATTTTT GAAAAGTGTG AAGTAACAAA ATATGATCTG ATTATCTTAT 
69801 TCCTAAAATC CTTTGCAGAA TTATCCCAGC CTCAATCTTC TCTTTAGTAT 
69851 TTAATGAGAA TAAGAAACTG GAAATGACTG AATTGGAAGA GTAGACTTTA 
699 01 AATCCATATC TTGATGGCAT ATACATTTTT C AG TTTTTTT TCTAAATGAT 
69951 TAATGAGGAT TCTCAAAACT TGAGTATCTT CTATGTTTCC CTTCAACATA 
70001 AAGAAATTGT ATGAAAATAT TTTAAAAATT TCTAATGATT TTATAG TTAG 
7 0051 CTATCTTGGG AATTCATTTC TAATCATGTA CCTCATCCAA ACTCCCCACT 
70101 ATGGACAAAA ATAAAATAAA AATTATTAGT TGCATCTGAA GGCCACATTA 
70151 C AATTTC TAT GCATTATAGA AACCTGAGAA AATGTATCTT AAAAAATAAA 
70201 TGTGAACAAC TAACCATAAT TATGAAGAAG AAAAATGAAA ACTAGAAATA 
70251 AACTATTGAA AAATGTCTAT GTATCAGTTA AGTTTTTATT TTAAAATTCT 
7 0301 TTATGTTTAT CTCTATAATA CTATTGGGAA AGAGAGAAAG GAAAACCTGA 
70351 CTTTGTTCTC ATCCAAAGGA GGTGATTCCA CTGATTTAGC CAAAATAAGA 
70401 CTTCCTGGTT ATAATAAATA ATAAAGTTTT TGATGTTTTT TATATGGTAC 
70451 CCCACTCACT AGGTGATCAG ACACCCTCCT GCAAAAAAAA AAAAAATACG 
70501 TATGCAATAA AGTTAAAGTT TTATGTTATT CTTTCAAGGG GAGAAACATC 
70551 TGTTTAACAC AGACCAGAAT ATTTCAACAA AGTCATCCCA ATATTTATGG 
7 0601 AGATCATAAA TCAAGCGAAA AAATATATTC ATCAACAACT AAACAAACTA 
70651 CATTAAATAG TCTCAAAGCA CATTTTCACT TTTTTTCTGA CAGGAAAACA 
70701 GGTTTCACAA GTGTGGAGAC ATTTTACCAT GGCTTTTAAC AGTGAGGAAG 
70751 GATGTTTAAA TAAAGGGAAA AATTATATGG AAAGCTCAGA GAAAAGAGAT 
70801 GGGTGTGGCT TGAGTGACAA GGTGAGAGCA GATCTCATTA AC TG AAATG A 
70851 GAGAGAAGGA AGGAATTTTG CAAATATGGA AAGATAACTA GTGCAAGTTT 
709 01 GAACAGATTA TGTCAATCAA TGTAGAATTT GGCTATCTTT TTAATCAAAG 

Fig. 2 (cont'd 39) 
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7 0951 AAGACTATGG AATATTTTAT AGGTGTTTGC TTATACTCAA AGTTTTAAAG 
71001 AAATAACAGT ATGAATTTGG TTGAACTAAT TTTTTTCATA GATAGGATTC 
71051 TCCCAAGTTA TATAGCATAT ATATTTCTTA ACTAGTTATT CTTCCTTTTA 
71101 CATATATTGT GCCACATTGA GTAACAACTA ACCTGCTAAT AGCTATTGGT 
71151 TTTTAAAAGA TAATTAATAT TAGAAAGTGA TCATTTTTCT GTTTCATATT 
712 01 AAACATGATA TTCTGAAAAA GCAACATTGC CTGAATGTTC TACATTTTAT 

712 51 C TTTTTG AAA ACAGGTTTTA TAAGAGATTT CTTGTGAAAA GCTGAACGTT 

713 01 CTGACACTGA AATAAGTCAG CTAACTCAAA GCTAAGCTTA ATTTTTTGAC 
71351 ACTGTTGGCA TGAGGTCTCA TTCCCAATTT TTTCATTTAA AGCCACAGGC 
71401 AAATGTTTTA ACAGATTTTA ATCCGTAGTA CAAGCATTAT TGATCTTAAA 
71451 TTTAAGGATA AAAACCTGAT TTTAATTAGA ATTTAATATG CATTC TAGTA 
71501 TTTACGTTGT ATAATTAATA TTTACATTCC ATGATTCCAC TATGTACCAT 
71551 TTATTTCTTT TTGAATAAAT TTCCAGTAGG AGCAGAATAA ATTTTCAGTG 
71601 AATATTTTAT TTC TTGGGGG ATATTTTTAA ATGGAAAATA TATTAAGTTT 
71651 CGGTAAAATC TGTTGCTAAT TTGGCAGTGG ACAGAATATA AAAATTGGAG 
71701 AGACTGAGTC ATTATGATGA ATTGGGTCTG ACTTTTGTCA TGACACTGGA 
71751 AATTTCCCAC AAATATTATA TTCTTCTTTT ATAATAAATA TAGTCGAAAT 
71801 GAATTGCAGT CAAGTATTTG AAGACCCATC TATAAATTTA GGCGGTTACT 
71851 GTTGATTTTT CATTATGAGA GATTCTTCCA CTCATAAGCT ACTAAAAGTA 
719 01 CATAAAGAAG GTCTGGTTGT TTGTTTTAAA TGTGACTGTT CTCTATCAGG 
71951 AAAATGTCAG GTATCCGATG AAAATAGATA TATGAGGTGC CAGGTATCTA 
72 001 TTCCAAACTT GGATATCACT TCAATTAGCA TCATCTTTTT TTTTTTTTAA 
72051 AGTGTCTAAG GTTAGAATAG TCACCAGATA TTCCCATGTA TGAAGCAATT 
72101 TTCTGCAAAG GCCGCTGTGG ATGATCTTTT TAAAATATAT ATTCTGGGAG 
72151 AC ATTG AG T A AAGAGAAATT ATTTACCAGA GAATGAAGAA CCGAGGCCCG 

722 01 ATTCTTTGGC TTTCTGCCAA AGATGCTGAA GGCAGTGATG AATGACAAAT 
72251 - ACATTACCAA GGAATTCTCC CTCTAAGAGG CTGACAAAGA TCTGATTTTT 

723 01 AGGATTATAT TACCACCAAG AAGATACCCC TTGTCACTGA GCTTCTAATG 
723 51 GAAATATGGT CTATACTGAA ACAATTCTCA GTTCTTTTTC TTTCTATCTT 
72401 TTTTTGAGTT ATTTTATCTT C C AAAAATG A GTTATTTCTG ATAAAATAAT 
72451 TCACTTAAAT AATTATGAAA GTTCAAATTT GTGCAAATAT TTTTATTGGG 
72 501 ACATCTTAAA ATTACTCTAA ATTCAAAAAG AAAATATATG CTTTATTAAA 
72551 ATTTGATCTG TAAGCTGCTT TGTTTGTAAT TTAACTATTA TATAAAAATT 
72 601 G TATAATAC A TATATTTTAT TTACTTTATT CCTGTGTTGC TTTGGCTTGG 
72651 TG&GACTAGG TCTCCACATT AGGAGTTTTA CTGAATGAAA AAGTATCAGA 
727 01 ATGTAACATG ACTTTGATAT GGCATCAGAA TTTAATAAGA TGACATTTAA 

Fig. 2 (cont'd 40) 
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72751 TAGGAATTAG GGGTAAGTTC CAGGTTTTAC ACTTAAATAC AAATAATCAA 
72 801 TTTTGCAGGC ACAAAATACT TCAAACAAAA TCTGAAATCA TTCATTTGAC 

72 851 AAAACTTCAG GTTTGCAGTT GACAATAAAT ACAATACAAT GCAACAGTGC 
729 01 AATAGTGATA TCTAAATATC TAATGTAATC ATAGGTAATA TTAGTAAGTG 
72951 TGTTATCTGA AATGAGTGGT GTGATATCCT GCTTTACTTT GTACTGGTGA 

73 001 GTTCTGGGTG CCACCTTTGA AAGGAATAAA GACTATTCAT ATCTCTTTTA 
73051 TAAGACAATA AGAAAAACAA ACAAACAAAC AAACAAAAAA CCACCTCCTT 
73101 TACTTTAGCT GAGAAAGAAG TTATTAGGTA CAGCTTGACA AGTTCAGCTA 
73151 AGCATCCAAA TCTTCCAGGA GGTTGTTACT ACATAAAATC AAACCTTTTT 
73201 AATTCAACTA TGAGCAGGGA GATTTTATTT TTCTTTCGGG TACTAAAGCT 
73251 TCCAAACTCT GTTTATTCCA CAGGAATCTG AACTTATAGA ACTAAGAGAA 
733 01 ACCATTGAAA TGCTGAAGGC TCAGAATTCT GCTGCCCAGG CGGCTATTCA 
73 3 51 GGGAGCACTG AATGGTCCAG ACCATCCTCC CAAAGGTATA TTTAGAAATC 
73 401 ATTTCATTTC CACCCAATAT AATAGGCATC TATTTTATTT ATTAATTACA 
73 451 G TAG AAC *FGC ATTTACTCAG TGTCACTGTG CATTATTAAT ACATACTAGT 
73 501 TGTATTAATA GTTGTATTAA TACATACTAG TAGTATTAAT ACATACTACG 
73 551 TTGGTATTAA TGTGATCAGA ATCCTAGAAT TTTAGAACAG TGACTTCCAT 
73 601 TATCAGATAA TTTTTAAACT GATCTTAAGA AATTTGGTTC TATAGTTGTA 
73 651 TACACATCTC TCTACTTGAT TCAGTGGAGA TGGAGATGGA GTGGTTGGTT 
737 01 AATACATGCA TATCTGACTT CAGGCAAAAC AAACCCATTA ATGAGTATGA 
73751 TAATCTAGAT CTGTATTTAA AAATGAAATA GTCAATATGA TGATATAGTA 

73 801 AGCAGTGGGC ATTGGGAACA ACTTTTCCTG GATGGAGGCT ATAAAAAGGT 
7 3 851 ACATTTCCTG TAGATAATTT TGAAACAATA AAAACAACGG GTGAAAGGTA 
7 3 901 GCTCTGTTTT AAATTATTCC TATGCTTAAG CAATTCTAAA CAATGAAAGG 
73951 GGTATTTCTG CCACTGCCCC TACCCCTGGG TTCACCACTG AAGAAATGCT 
74001 CATTATTAAT ATCGTGTCAT TTTTTTCCTT TACATTGGTT CTATTTACTC 

74 051 ATTTCCTGAC ACTTTTCAAT GGCCTTCAGT GAGCTCAGCT CTTTCCCAGC 
74101 TTAAAAAATC CTGTCCTAAA ACATGAATGC CTTATTATCT CTCTTTTCAT 
74151 TTCCAGAAGA ATTCTGAGAA AAATTTTATG AAGTCTTTCA ATGTCTTCAG 
74201 CCATCTTTAG ACCACTGGAG TGTAGCTCCT TTTCCCTCCA CTCCACCAAA 
74251 ACAATGCTCT CCAGGATCAG CAGAAACTTA CATGACACTA AATTCAGTAA 
743 01 AACGTTTATA ATTCTTATTG TATTAGACAG ACATGGAAAC AGCATTTGAT 
74351 GCTGATATTC ATTTCTTCCT ATGTGAAACA TCCGGTTTTT CTAATGTTCG 
74401 TGACATCATA CATTCTTGGT TTTTCTTCTG TTCCTTTGAA ATATTTTTTC 
74451 AATATTTCTT TTGTAAATTC ACTCTTTTGT ATCCATTTGT TAATTG TTG A 

Fig. 2 (cont'd 41) 
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74501 


TATCCTAAGC 


TCTCTTCCAT 


TATGATTCTA 


TGCATCCTAT 


TTAAAATATA 


74551 


TAGAAAATCA 


TCTCATACTC 


TAGCTGTAAT 


TTTTATTAAT 


GTGCTAATAG 


74601 


CTAATAACTG 


TCAAATCTAG 


GTCTCCAGGC 


CAGGCTCTGT 


ATATCCAGCT 


74651 


ACCAAGAGAG 


AACTCCACGT 


GGATATCTTT 


GGATGTCTGT 


TTTGCATCTT 


74701 


AAACCTAACT 


TCTCCAAATT 


TGCACTTGTC 


TTCTGTCTCA 


GACCTGCTGC 


74751 


TCCTTCAGTG 


CTCTTTGCCT 


CAGTAGATAG 


CACCACCATC 


CTTCCATTTA 


74801 


GCCAGAAATC 


TAAGTATTCT 


TCATAACTCC 


TCCTCTCCTC 


ATTGAATAAA 


74851 


TTACCAAGAT 


CCGTTGATCC 


CATTCCTTAA 


ATATCTCTTG 


GATCTGTTAA 


74901 


CTTTTCTCTG 


ATTTTACTCT 


TGCCATCCAT 


CACCTCTCTC 


CTGAACCATG 


74951 


ACCACAAACC 


CCTAAATAGC 


CTTCCTCTTC 


TTAATCTTAT 


CCTGCTTTAC 


75001 


ACCAGTCTTC 


ACGCTGAAGC 


CAGAATAGTC 


ATTAAGAAAC 


AC ATC TAC AG 


75051 


GTATCCCATT 


CATTGCCTTT 


AGAATGGAAT 


ACAGACTCCT 


CAGCATGACA 


75101 


TAATCTCTCT 


TCACCAGCTT 


CATTTATTCA 


ACAAATATTT 


ATTCATAACC 


75151 


AATTAAGTGC 


CAGATGATGC 


ACATATAGAC 


TTCTTGTTC T 


GTTGTTGCAT 


75201 


TGCATATTCC 


ATATTTCAGC 


TATCCTGAAT 


TGTTTTCAAT 


TATTCATAAG 


75251 


TTCTTTATGA 


ATTGTGTTCA 


TTCCATTTGG 


AATATTCTAC 


CTTGTTTGAT 


75301 


CAGCATAAAG 


ACTTTTCGAG 


ACACTGCAGC 


AGCAGTGAAC 


CTAAATATGT 


75351 


TTCCTTGACC 


CCTACATTGA 


ATGACACCCC 


CTGTGATATG 


TTTCTGGAAG 


75401 


CAGCAATACT 


TCCCTTCTTA 


AAATTACATT 


ATACTTTGGG 


GCTTTTATTT 


75451 


AAGGTATGTC 


TTTCCTGATT 


TACAATAGTA 


GAGCTTGTTT 


TTTCACCCTT 


75501 


TTGAAAGACA 


TCAAGATGCC 


CATGATGATG 


TCTTGCATGT 


AACAGGGGTT 


75551 


TATTTGAATT 


TTTAAAAGAA 


GAATAAAG TA 


ATTTTTAAAT 


GAATTTCAAT 


75601 


TTAAATTTTA 


GGAAAACAAT 


TATATAAAGT 


GAGATATGCT 


TAAATTGAAG 


75651 


GACAAAGTAG 


TTCTGTAGGG 


GCTACTTCTT 


TCAAGACTTT 


AGCAACTTTC 


75701 


CATGTGGGGG 


AGTGATTTAT 


GTGATGCATG 


GAAAA.TTACT 


GCATATTTAA 


75751 


AGCTTATCTT 


AGAGCTATAA 


TAAAGCAGCT 


TATGTTCTAA 


ATCTTCATGT 


75801 - 


CGTAAATAGG 


TCCAGAAGGG 


ATTTAAAAAG 


CCTTAATCCT 


TACTTTAACA 


75851 


CAGCACAAGT 


CACTGAAGTG 


AAACTTGCTG 


AAAGGATTCC 


TTTTATGTTA 


75901 


GGCAACAGGT 


AGCTGAATAT 


ATCTACAGAA 


ATTGAAAAAT 


TGGAATTCTT 


75951 


TTGCTCAGAA 


ATGTGGGAGG 


GGTGGAGCTT 


AAGGTAAAAA 


ATAAC AG TT A 


76001 


ATATCTAAAT 


TGATCAAGAA 


ATATGAAAAA 


ATAATTTGC T 


AGGTTTTAAA 


76051 


ACTAACAAAA 


ACCATGGTTA 


TAAAGGTTTG 


AATATATATA 


GGATAGTTAG 


76101 


ATTG T ATTTC 


TGTAATATTA 


AAACTCAGCA 


TTAAATTTAA 


TGAACACAAA 


76151 


GTGATTCTTA 


TCACATTGAC 


CATTGACATT 


ACATGGAAAA 


AATAGTCAGT 


76201 


TGGACTAATT 


ATGTGTCTTT 


CCATGGGTTA 


TTAAGG TAAT 


TGTATGGCAT 


76251 


ATAAATTTAT 


ACTGGAAATC 


ACATTGAAAT 


TCACTTTTAG 


AGGCCCTTAA 
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76301 


AATATTTCTG 


TAATATATAT 


TTTTAACATA 


TGATCTTAAA 


AGATATATTT 


76351 


GGAATGACAC 


AACAGTTTTA 


TAGACAGGCC 


TGACTATCAC 


ACAACCACAC 


76401 


ACCAATTTGT 


GAATGTGTTT 


CTATTTCCTC 


TAAATTAATG 


CATCACATTC 


76451 


ATTAACAAAG 


TTTGATAAAT 


GACTATAGTC 


TATAATAAAA 


TATTTTTGTT 


76501 


TACAAACATA 


TTTAAACACC 


TGCTATTAAG 


TATAGGCATT 


ATCAGATCTT 


76551 


AAAATACAAA 


GATTTAAAAA 


ATTACCCTGT 


GGTCATGGAG 


CTCACAATCC 


76601 


ACTGCAAAAA 


TAATGTTTGT 


GATAAGAAAT 


TTGAAAGTTG 


AAGGTAATAG 


76651 


AAAATTTTAC 


CTTTATTTTT 


CAAAATGTAC 


CATTGCTTTC 


TAAGTCACTA 


76701 


CTTCTGTGTA 


AATATGGAAT 


TGTTTTTCCT 


TAAGATATAC 


CAAATATAGT 


76751 


TGGATAACGC 


ATGTATTAAA 


ATTCTGTCAG 


CACTAAGTTG 


TTTTTTAGAC 


76801 


ATAGTGATAG 


GCAAACATAG 


TTATATTGAA 


TGAAAAATTA 


GAATCAAATT 


76851 


TATTAAACAC 


TGTGTACTGA 


TTGATACCAC 


ATGCCATATG 


CTTGTATAGC 


76901 


AATACAAGGT 


TTGGAATTTA 


TAATGGTAAA 


CAAAATAGAT 


ACGGTCTTTG 


76951 


TCTCCATAGA 


ACTTTTAGTC 


TAGTGGGAGA 


GCAGAAGGTA 


AAGGAATGTA 


77001 


TGTGATCATT 


GGTGAAGCTG 


AACATGTATA 


CCCAAACAGT 


TATAAGTTCC 


77051 


AAGATGGACA 


ATAATGGGTG 


CCATAGGGAA 


GGAGGGTACC 


AAGGAACCTA 


77101 


CTGGAGGTTA 


CATAGGGAAG 


ATTATTCCAA 


GGTAGTAATA 


TTTAAGTGAA 


77151 


TATCCAAGGA 


ATAATTGTCA 


ATCACTTTAT 


AAGTACTGAG 


GGAGGAGTAT 


77201 


TTCAAAAGAG 


CTTTGAGGCG 


GAAAATAAAT 


TAGTTCCTTT 


ATGGAACTAA 


77251 


TGTAAGGAAA 


ATACTAAGCA 


AACATGTAAT 


AAGAAGAACA 


CGGTTGATGA 


77301 


GTTAAGAACT 


GACAAGATTA 


CTGAAGGATT 


GTAGGCCATA 


TTTAGAAGTT 


77351 


GGATTTTTTA 


TCTATTCTTA 


TTAAAGTGAG 


AAGTTATTGA 


AAGGTCTTAA 


77401 


GTGGGGGAGT 


GATGATGAAG 


TTTGCCTTTT 


AAAAAAGATT 


TTTCTAGCTA 


77451 


TTGTTTATAG 


AATGGTTTGA 


AGATGAATAA 


GTCCAATAGC 


TATACTTGCT 


77501 


GTAAAGGTTA 


TGTTGGTAGC 


TTGAACTGGG 


GCAGTGGTGA 


CACAGAGGAT 


77551 


GGGAGATGGA 


AAATGACGAG 


TGAACAAACA 


CATACCTGAA 


AATTTAAGTT 


77601 


TAAAAATAGA 


CCTCTCCATT 


AATTCAGATT 


GCTGATATTC 


ATTCGGTTAG 


77651 


CCATTCTTTA 


CTGAACTTTA 


TGATGCCCCA 


TATACTGAAT 


TAAATACTTA 


77701 


CAAGCACTAA 


AAAAGAAATT 


GTTAGGGAAC 


AGTAAAATGC 


ATTTCCTTCA 


77751 


TTTCACAATA 


TTATTAATAT 


TATGGCTTTG 


CTAATCTTTA 


TTGGTGAATG 


77801 


CAGTCATAAT 


TGAAGGTAAC 


TGATACTTCC 


AAGGACTACT 


TTTGACCTAG 


77851 


GATTACTATC 


TTTTTAAAAA 


TTTAGTATTA 


AAGAAGTCAA 


ACACAATTTA 


77901 


TTAATTCTGG 


ATATAATAAA 


AATTCTGAAA 


TACTTTAATA 


CTTTGTGCTT 


77951 


TTCTATTTGT 


GAAAGTTAAT 


TATTAGGAAC 


GAGCTAGCAA 


ATGCTACTTC 


78001 


TTTTTCAAAA 


AGCTAATGGC 


CAATCACAGC 


AAAAATTTAA 


AGCACTAAGA 
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7 8051 AATACCTACA CATATTCTTC TATTGCCCAT TTATATGACT TCCATAATAG 
7 8101 TTGATTAAAG GATACCGGAT TCCTTTATTG TTGAATTAAA ACCTCCTACA 
7 8151 TGAAAACCTT GATTTAGGTT TAGAAGTTGG TAATGTTTTG GCATGCAAAA 
7 8201 CCAGTTAATG TTCTCATCAT TACTTTTTAA AACAATGTTA AGAGATGAAT 
78251 TCTAGGGATT ATAAAAAAAA AAAAGCTGTA TGTGTTTCTT CCTATAAAAT 
7 83.01 TTTTCAGCAT GATTGCCTCA GTAGAAAAAT TAAGGGACTT ATTGATATAT 
7 8351 ATGTATATGA AGGTGAGGAT ACACATATAC ACACACACAT ATATATGTAG 
7 8401 GTAAATACAT ATATTACATG TCTATCAATC CATACATACT CATTTATTAT 
7 8451 ACGTTTTGAA AGCAACCAGT TAT AG TTTTG TTGCCATGGA TCATTTTTAC 
7 8501 TATTCAGTAA ATCAGTCAAT TGAAGAGGCT TGATTTTATG GTATTAGTTT 
78551 TTTGGAAACT GTCAGCTTTA TAGTAAATTT TGACATCTTA CAACTTCCAC 
78 601 TGAGATTTTT TTGCTTGACT AATCTGCCTT GATGCCAATA AGTATATTAA 
7 8 651 CGGAAATGGA CTAAAAGCAA ATGTGACTTG AAGCACAATT TTGTAAATTT 
7 8701 TCTTAGTGTC TCAGTAATAC TTAATAC TAG TGCATTTTAG GTAGGAAAAT 
78751 TTTCAGTTTG TTTTATTTTA AATAACTATA AATCTTATAG TTGCTTGTAT 
7 8801 AAAAGAAACA GATACCTTTA ACATGATTAA ATATCAAATG CTATTCTCTT 
7 8851 CAAAATATCT TAACTAAAGA AGCACTGCCT GCTCTTAGAA GTTAAGCAAG 
7 8 901 GCCATACCAT ATGCTGCGTA CATGGCTTTT AACACAATGG ATATTAGAAA 
78951 CAGCCTAAGG CTGAGCCTGG CTCCACTATT TTTCAGCTAT GTGACCATGT 
79001 GAAAGTTACA TTTAGTAATT AAACTCATTT C AG TAG TTTG CTTTAAGAAT 
79051 AAAATTAGGT ACTCCGGGGG CATATCAAGC ATATTG T AAA ACCTAGTTTG 
79101 ATTATTATTT GTTATTGGTA TTACTATTAC TATTCTATAA TAAGTCATGG 
79151 GCAGGCAGTA GGGGTACATT GGAAGAATTG CACTGTCTTA AATATGTCCT 
79201 CTGTTTAACT CACAAACTCA GTCTACCTAG GCTTTCTTTG GAGGATCTGC 

792 51 CTTTCATTGG CTGTTTGACT TTGGCCAAGT TACTTAACTT CTTTTCACTT 
7 9301 CAGTTTCCTC ATCTGTGAGA TTATGTGCTT ACATGACTTC AGGTTTTGTT 

793 51- TTGGCTCTAA TATGGTATGA TTCTATGAAA TGGAAAGTTA ATACATTTGG 
79401 C TC TAG TAAC TGTATTTGAA GCACAAATAT TAAAAAGCAC AATTAATTCT 
7 9451 CATTCTGAGT TTCCATTTAC TCTTTTAAAT TAATCATTCA GAATAAATCA 
7 9501 TTTTGGAAGA GCTGCTTGAT CCAGGTATTC AGTAGAAATC ACTAGCATAG 
79551 CATTTAATTT TAGACAAAAC TGAGAACTCA TTAAACTGCC AGGGCTATGG 
796 01 ACTTATATGA GATTCTCATT AAATCTTAAT GTAGATAACT CAGTTAATTA 
79 651 AAACAAATAT GGTTGTACTT TATTAAACTT CTAAAGTCAA AACTGCATTG 
79701 AAATTATCTG TACAAAGCCT TGTTGACCTT TATTAGAGAA CTGCCTCTCA 
7 9751 AAAGACCTAA AAGACTTATT TGTTCAGATC GAGACTCTTC ATGAGCCAAT 
79801 GTGATACTCT CCCTCTATTG CTAGATCTTC GCATCAGAAG ACAGCATTCC 
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79851 


TCTGAAAGTG 


TTTCTAGTAT 


CAACAGTGCC 


ACAAGCCATT 


CCAGTATTGG 


79901 


CAGTGGTAAT 


GATGCCGACT 


CCAAGAAGAA 


GAAAAAGAAA 


AACTGGGTAA 


79951 


GTTACCATCC 


TTCATCTAAT 


TCAGAAGCTT 


ATTAATGCAT 


AATGTGTTAG 


80001 


GCCTTTTTCT 


TTGGGGCTTT 


AGTGATCTGC 


AGTAGTTTAC 


AAAGGGTCCC 


80051 


ATTCAAGCTA 


CTGAGACCTC 


AAATGCTGCA 


CTCATCACCA 


AAATTGGAGT 


80101 


GGCATGTACT 


GAAAAGCATA 


CATTTTAATG 


TTGGGACTAA 


ACTTGGGTTT 


80151 


GAATCACCAC 


TATATCTAGA 


CCTTTTGAGG 


GGCCTGAATT 


TTCTAACCAA 


80201 


TAAAAAGACA 


GTTAATAGCA 


ACTATATTTA 


TTTGTGAATA 


TCATTTATTC 


80251 


ACAGATGTTA 


TCTAATTTTT 


CTATAGTATA 


ACTATACAAA 


CTATGTAGTA 


80301 


TAACTATAGA 


GTTATACTAA AGAAAAATAA GATAACATCT 


GTGAATAAAT 


80351 


GGCTTAAAAT 


AGGGGTTTAT 


TGTGGGCATA 


GAGATGAAGG 


AAAAGTGAAA 


80401 


AAATGATGAT 


GATGGTGATG 


ATGATGGTGA 


TAGTGGTCTT 


GGAGGAAAAG 


80451 


GAGAATGGGA 


GTTAATAAAG 


GGAAAGAATA 


AACAATGAAA 


CTCTCATTCC 


80501 


ACCTTTGGAA 


TCGACAGGGC 


TTACCGTGTG 


AATAGTTTCA 


CCCTAAAAGA 


80551 


AATCAACGAC 


ATTAGTGTCT 


GCTTGATGTT 


TTTAACCAAG 


AGAATATAGC 


80601 


AGAAATATAG 


AAATGCACTT 


TAACAGAACT 


GTACCTTAAG 


TTTGCTAGTG 


80651 


ATATAATTTA 


TGATATTGAT 


CAATAGCTAA 


ATAGCCCAGG 


GGAAGATACT 


80701 


GTTACTGCGA 


AAAATTTAAA 


AACAATGGAG 


TCAATGATTT 


CTTTTAATAC 


80751 


CAAAAAAAAA 


ATGTAGATTT 


TG AG TAAATA 


CAACTCTTGA 


TGAAATCCAG 


80801 


ACATAATTAT 


CAGAGGATTT 


TACTGGAGTG 


CTTTCTACAA 


ATAATGAAAG 


80851 


AAATATCTTT 


TTATCTTAAA 


AAATGTTTAT 


ACAGGTAATA 


TTTTAAAATA 


80901 


CTGATCAGCC 


TTCATTCCCT 


TGATTTGTAA 


TTCCACACTC 


TTTCATGTTT 


80951 


CTGCAAGGTG 


AACTCTAGAG 


GAAGTGAGGT 


GAAXATAAAC CGTGGACAAT 


81001 


TTGGCATGGA 


TXTATAAAAA 


AACCCTACCT 


TGGCATGAAT 


GCTATCCATT 


81051 


TTGGCAGTAG 


GCTTTTATAC 


CTTTTAAAAC 


AGATTACCTT 


GTATGTCTTT 


81101 


TCTTTGTGTC 


TTTTCATTTT 


AATCTCAAAT 


TTTAAAGAGA 


TGTAAAACCA 


81151 


CTTTCTGAAT 


AGAGCTGTAG 


GGGATACCAA 


TTCTGGTTTT 


GAGTAGTCTG 


81201 


GGGTTGGAAA 


ATTTGAATAG 


AAAAATCACA 


ATTAATGAAG 


TGTTAGGTGA 


81251 


ATTTGATTTC 


ATTTTGCTTT 


TTAAGTTTGT 


ACTGTCAGCA 


GGACATGACT 


81301 


TGATTGTAGC 


GCTAAAGTGG 


CCATTTAAAA 


CAAATTGCCT 


TGAAGAGAGA 


81351 


AGCATTGGGA 


ATGGAGATC 
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1 
51 
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151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
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851 
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951 
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1101 
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1201 
1251 
1301 
1351 
1401 
1451 
1501 
1551 
1601 
1651 
1701 



• Human genomic sequence 
GAATTCCTGG TGGAGAACAG CACATGTACA GATGGGGTGA GAACAGCATA 
CGTACAGGTA GGGGTAAGCT GGTG CTATAT GAGAAAGCAT GGAATAAGTT 
ATTAAGTTTG ACCTGCTTGG GAACTGAGGG GCAGGTGTGA GGGATGAAGC 
AGGAGTAGGT AGGGGCTAGA TCACAAAAGA TCTATGCCAG TGTTTCTCAC 
AGTGTGATTC CCAGCCCAGT AGCATGATAT CACTTGGGAT CTTGTTAGAA 
ATACAAATTC TTATACATCA CCCTGGACTA GACCACCTGA ATAAGAAAAG 
TTGGGCATGA GGCCTACAAA TTTTTAAAAA AGTCATACAG GTGATTGCAA 
TGCATGCTAA AGTTTGAGAA ACACTCTTTG CTGTGGTTTG AATATTTGTG 
TCCTTCCAAA ATTC ATG TAG AAACCATCTG CAATGTTATA GTATTAAGAG 
GAGGGACCCT TGGGAGCTGA TCAGATCATG AAGTCTCCTT TCTTATAAAG 
GGGATTAAAA GCCTTGGCCC TTTTACCCTT TGTCCATGTA AGGACACAGT 
GTTGGAAGCA GGGACTGGGT TCTCACCAGA AACAGAACCT GCCAGCCTCT 
TGGTCTTGGA CTTCTCAGCC TCCACAATTG TGAGAAATAA GTTTCTGTTG 
TTTATAAGTT AACCAGTCTC AGGTATTTTG TAATGGCAGC ACAAAGGGGC 
TAAGAAACTG TTCTATGCCC TAACAAGAAA TGTGGTCACT TTCCTGAAGG 
AAATGGGGAT ATATATAAAG ATGTTATATA AGACTCGTAA TATTTATTTG 
GAAGGCTTGC TCTGCAAGCA AGGTGGAAGA GCAACATGAA GGAAGCGTGG 
TGGAGGTGAG AGGACTGGAG GTTAAGTTGG TAGGGAGATA CAGGAAAGAA 
GC TTATG AC A CTTGAGTTAA AATGTAGCAT CCTTCCTATG TGTAGGGCTC 
ATAAAAATGT ATAG TCTAAG ATAGAACACA GAATACTCTA TGAATCCTGC 
CCACAAGGTG TTGGTAATCT AGATTCACTT TTTTTTTCTG ATAATGCCAT 
CCATATGTAT GGAGCGTCTA CTACTGTATG CCAGAGTGAC TCTGGAATCG 
GTTTGGTTGA TCTAGACAAG ACCATAAGGA GAGTCCCCTT ACTACCTCTT 
CTCCAGGGGA GGGATTCAAG TTGAACTAGT ACTTCAGAGA CTGTTTAGTA 
ATATCATGCA TGAAAGGTGA TGGTTAGGAC AGAAAAATAA ATGGATTGCA 
TCATAATTCC TCAGGTTCTC CAAATATGTG GTGGTCTCAA ACCATGTGAA 
TTGGTCTGCA CATCCTGTTT GGGTTGCGTG TCAGCAGTTG AGATCTGAGC 
CTTATTTGTA ACAGTGAAAC AGTGAGAGAC CTGCCCTTCA AGAGCTGTTT 
TTCAGCTAGG AATAGAAAAG GGCCAGGCTA GACTCCTCTT TC TGCTGG AT 
CTTGCTTCTT CTCAGCAATA GAAGTAGACC TGCCTTCCTA GCTGTAGAGA 
AAAGGTGCCG GTAGGCGGGC AGGTGAGCCT GTGGATAATC CTGGAGTAAA 
GGTTCAATAG ACCTTCAAGT CTATCCTACA GGATTCGGAG TGAGGGGAGA 
GAAAAGGAGA CGCTTCTCTG GCTGAGAGAG GAAGAGAAAA AAAAATCCCA 
GATATCTGAC AGCTATATCT TCCCATCACC ACCTTCCTCT AAACCCATGC 
CTCTCTGTTT AGTAGGACAT AAAATGAAGA GTGACCCACC CCCCACCCCC 



Fig. 3 
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1751 
1801 
1851 
1901 
1951 
2001 
2051 
2101 
2151 
2201 
2251. 
2301 
2351 
2401 
2451 
2501 
2551 
2601 
2651 
2701 
2751 
2801 
2851 
2901 
2951 
3001 
3051 
3101 
3151 
3201 
3251 
3301 
3351 
3401 
3451 



AGCCCATCCC CCGTTTGTAG GTGTGCTTTC AATGAAAATA AGTCGGTGTT 
CATGGACGGA AACTAGAGCA GCTGAAAATA GATGCAAGAC TTGTTGAGCA 
TACAAATCAT TTCCCCCTTA GTCTCCAAGG GAGGAAAAAA AATCCCTCTT 
ACTCTCCTTG CAGCCTGTGT TCTGCATTCT GGAGAGGAAG CTGAGGCTGG 
TCCTCAGGCG CTCCTCCCGC CGTTCCCGCA GGAAACTTTT CTCGCAGGGC 
CCGCTCCGTC CATCCCGCGC GGTTCCAAGA CGGTGGGCCT CCCGTGGGCT 
CCTCTCCTGG GCAAGGGCCC AGACCCCGCG ACGCGCCTGT CTCTTTAAAT 
TCCAGCTGCG CGGCTGGGAA ACAGCGCCAC TCGCCGCCCA GGCCGGCTGG 
AGGCTGAAGA GCGAGCTCGC GCTTTCGCTC CCGGCTGCGC GCCGCGGAGA 
GCTGGGCTCG GCCCGCGGGC TGCTAGGTGG CGGCGGCGCG GGGCGGGGAG 
GCGCGGCCCG GCGGAGGAGG GAAGAAAGAG CGAGCCGGGC CGGGAGAGGC 
GCCGCGCCCG GTCCCGCGCC CGGTCCCGCA CCCGCTCTCA GCGGCCCAAG 
C AG TTTCTTT CTGGGTGACA AGAATGTGCC TCGGTTGGTT TTTCTTTTTT 
TTC TCCATCT CCTTAAGACG ATTTCCATAG TAACCTGATC AAGTGGCTCA 
AAATCGCAAA CCTGAGGATT TCCGCGGCCC GCCGGCAAGA CCTCGGCCAG 
GTAACGCTGC GATCTCCTCC TCTTCCATTG CAAACCGCTG CGCTCCTTGC 
AAAGTTCCTT TTGTGGAAAA TCGCCCAGCC CAAGGGAGCC CGGGGTATTT 
GCAACAGCGT GTTCATTTCC AGGTGCCTGT CACGGGTCTC CTCCCTGCTG 
CTTCTCCAGG ACCCATGATG AGATTATTTT TAAAAATTGT TTTTGGTCGT 
CTCCCCCGCC CCCTCCCCTT CTTTATTTTT TTCCTCTTCG CTGCACTCTT 
CTCGGCTTTT CCCCTGACAC TACTGATGGG GGTGCGGGGG GACGTCGGGG 
ATGGGGGTGG CCAGCGCGGT CCTGGGAGTG GCGGGTTCGG ATGGGCTGGC 
TGCGGTGGGC CACTTTGGGC ATCTCGGCGT GGCCTGCGCC GGGGTCACGG 
GGAGGGCTGT CAGCGCCAGG GCGGCGGAAC CCGAGGTCTC CAGACGAGTG 
AGGGAGGGAT GCAGGCTTGG GGGTGATGGA GCGCTTGGCT GGTGGCTGGT 
GAGCGTCCAT ACATCATAGC TCTCCTTCCC ACTCCCCCGC CCCTCTTCGG 
GATTCTCTCT TTCTCTTTCC CCGTCCTCAT TTCTTTCTTC CTTTACTCAC 
CACTCGCTTC ATTCTCTTCC TTCCATTTCC TCTTTTTTTC TCCCCTCATT 
TCC TTTTTTT CCTTTCCCTT TTAAAGAAAG GGGAATCGTT TGTAACCCTT 
TCGTTCTACC AACGTGGAAT AGCTGTGAAA CCTGCAGCGT GGTCACCTCA 
GCCTGGTCGT TTTCAGACCC GTCCTCATCC ATCAACATAT TTGTTTCCCG 
AGTCTATTGA TCTCCCTGAA TTCTACAGAA ATGCATTCTA AGCTAGGCGC 
CTGTATGTCA GAATCAGTTC TGCAGGTAGC TTCCGTGCTC CAAGTATGAC 
ATGTATTGTA AGGGCTGCAT CTGTTTTAAA CCCACATAAG CCATGGGTAT 
AAATAAATGT AGCTTTGAAA AAAAATCTGG CCTTATTCTA GATAAACTTC 
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3 501 CCTCTTAAAT TACTGATATA CTCTTCTCCC TCTTTGACAT TTAATTTTAG 
3 551 GAAAGTTGGG AGACAGGTTC TTGTCCTCCA GTTTTTAAGG AGCAGGCAAC 
3 601 TTCTATTATC TTAATTTTCT CGTCTTTGAA CATCACTCAC GTTTGCACTA 
3651 CCCAGTCAGT GGAACGAGTG GGTCATAATT AA 



Fig. 3 (cont 1 d 2) 
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r Pf"" 1 n" , T""PT"»Tl A rpo 

1 0 1 1 1 1 1A 1 C 


rrv"' tv pnim/^n tv h 

TGACTTCCAA 


TTTTGGTGTT 


CCCTGGGTGG 


D X 


1 Vjvjvj i J, 1 1L 


/Tpf" , A /"* 7v /™i 7\ rnm 

G 1 oALAlATT 


TAC AAG ATGC 


TTTTGGCAGG 


TTGGCTGGAA 


1UJL 


mrnrn*-* -p. 7. r^r>c* A 
111 


CA1 ITAATTG 


nr. TV /"•/^m/** TV Tv m 

TAGGTGCAAT 


AAAATATTCA 


TTTTCTCTTG 


J. D J. 


1 1L1 XGG 111 


GAGATGTCAT 


GCCCTTTTGG 


TCACTTATAT 


TTTGGTGTGA 


ZUi 


Vw- IAj Hjlblbl 


G 1 G 1 GTATGT 


GTTTGTGTGA 


AGGATTTAAC 


AAAGTCTGTT 


z ox 


C I AACTGTC A 


TG TG ATTTG A 


AGTTAAAAGG 


TATGTTAGTG 


ACAAGCCACA 


■5 ri i 


AATTTC TC TT 


ATTTATAGTA 


CATTGATCCT 


GAAACCATTT 


TTTCCCTTGT* 




GATTTC TTC T 


mo /—i tv m/"^ /1 tv m 

G TG C ATGG AT 


CATTTAACGA 


AAGGTTGGCA 


ATGATGAGCT 


A fl 1 


Al 1 1 1 1 I 1A1 


7\ 7V m TV ✓""»/""* TV TV * TV 

AATAGGAAAA 


AAATTCCTCA 


AGTTTACTTA 


CCAAGTCATA 


jI 


rpntiiuprn 71 rp tv /-^t t\ 
1111 1A1ALA 


/"* 7\ Z" 1 O TV f 1 \t 1 1 TV 

GAGGGATTAG 


CAAATATTTC 


TGATC TAATA 


TTTTAATAGA 




C IajAAI XGCT 


G AC C AC TGC T 


AATTACCAAG 


AATATATTTT 


CTTAATTCTG 


bol 


AAATTGC TG T 


ACCTCTCAAG 


TTGTCTGGAG 


GACTCCAAGT 


GACCCAACTT 


601 


GTAACTCATG 


GCAACAGGAA 


GTGGTTGTTC 


TGGGTGCAAG 


CTGAAGTGTG 


651 


CACATGGACC 


CGTACTTTGT 


TAGCACTCGG 


GGACTTGATA 


TGGAAAGAAT 


701 


TAATGTACTG 


GCTTTTTTGT 


ATAGATGAAT 


GTTAACTTTC 


TGACATTAGT 


751 


CAGAACTACA 


TCTCCCAAGC 


CTTGTTTTGC 


AGTGTCTGTC 


CCTTTGCTCT 


801 


TCACTTACAG 


TAAGTCCTTA 


CTTAACTGAC 


TTGATAGGTT 


CTTGGAAACT 


851 


GCAACTTTAA 


GCAAAAGGAA 


GTATAATGAA 


ACACTTTTAT 


CACAGGCTAA 


901 


TTGGTAGAAA 


CAAGACTTAA 


GTTCCCATGG 


CATATTTCTG 


GTCACAAAAA 


951 


C ATTTC C AAA 


CTTCTCAAAA 


CACTTCAATA 


TTAAGCATTC 


AAATACATGT 


1001 


TV t\ Tv y*im tv rTYr—im TV 

AAACTATGTA 


TATATGTAAG 


AAAGGTTACT 


ATAAACCAGA 


TCAATATTTA 


XUbX 


ooo'Tv Ts mm tv miTi 

CCCAATTATT 


T AAG TTC AGG 


GTCTTAGGTG 


GCTGGAGCCT 


ATCCGAGTAG 


1 i m 

11U1 


v- I CAGGGCAC 


AAGGCGGGAA 


CCAGCCCTAG 


ACAGGACACC 


ATCCTGTTGC 


1 1 CI 

1131 


/VGGG(_A\_G1 1 


LACACATGCC 


CACACGCAGG 


CTGGGACCAT 


TTACATGTGC 


i o m 


v-AAl ItAULi 


AC C ATGC AC A 


TCTTTGAGAC 


GTGGCAGGAA 


GCAAGAGTAC 


i *? i 


PTVPSPS AAA 


rp^~«f~i tv rpiv pun* 

1 C CAT AC AG A 


TATGGGGAGA 


ATGTACAAAC 


TCCACCCAGA 


XJUJL 


\~t\\j 1 GGAL.CC 


AOL cagg aat 


CAACATTTGG 


GCAACATTAT 


AATGAAACGA 


1 -SCI 


AGTTGAATGA 


tv tv my~i TV m/^mr*iA 

aatgatgtcg 


TTCCACGACC 


TGCTGTACTT 


GAGGGGTGTT 


X ft UX 


A1AAAA1 it- 1 


71 /I TV TV /^l TV /"l TV 

cagaagacag 


AGGTTTAATG 


CTATCTTTTT 


AATAGAAAAT 




7A A /^>T>rp a rp t\ 0 A 
AAL 1 1 A 1 AGA 


/^v tv tv rno ^"i tv 

GAAGTGTGCA 


CATGTGACTT 


TGTGTGTAGC 


AGGAATCATT 


1501 


AGGATGAGAA 


TCAGACGTAA 


GAGGTGGTGC 


CAACATGAGG 




1551 


ttcagggagc 


TGTGGATGGA 


AGTAGAAGCC 


AGAAGGCCAG 


GGTTAGGTTC 


1601 


CTACTTCTTA 


CTGTTTCAGT 


TATTGCAGTG 


TTGGCCTGTT 


TATTCACAGA 


1651 


TGTCACCTAG 


C TTTGTTTTC 


TCAAGAAGAA 


AAATGAGCAT 


AATCTTTCCT 


1701 


GTTATGAATT 


CTTAAACACA 


CAGGACATAA 


CCACAGACAC 


AGAGGTGCAC 






Fig. 


4 
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1751 ATATGTAGCA GTAATGGATA CTAAATGATA CACTCGGAGG AAACAGAAAA 

1801 GACTTCTGAA TAGAGACTGG AG AT AC TTCC TTGGACCATT GATGAATGGG 

1851 CAATGATGCA TTTTTGTCTT CCATTCAGAA GGCTAATATA TTGCTCTCTA 

1901 TGTTCTATGG ATAAAGGCAG TATATGCTCA AGGATGAATC ACATAATATG 

1951 CATAATAAAT CCAGCAAGCA TTACCCTTTT ACTTATGTGA CTGCAAGTAG 

2 001 GAATACATTT CCCCCACTCT AACCATGTAA GATTTCTTTC CCTTCTCCCA 

2 051 TTTTGTAAGC AAAAGTAAGT TCCTGAAAGG TTAAATGGAC CTCAGGATGG 

2101 GAAAAATCCC CAGAGCTATC TTTCTGCACA GACTTCATTT TTTCTCCCAA 

2151 GTCTGACTGT CAACTGCGAT ATCTGATATG AGGCTCTGGT GCTGATGTTT 

2201 CCATAGGTCA TCATCCTTCG GTGTCCCAGA TGAAGTCTCA GGTCGAACAT 

2251 TGCAATAGCA CAGATTCTGA ATTTAATGCA TCATTAAAGT TGGTTATGTA 

23 01 ACCCAATGGC CTTGTTAAAC TCCAGATTTT TAAAATTATA TGTATTTACT 

2351 ATTCTCTTAT TTTAGAATGA TCTCACAATG TTCACAAGAA AT AAGCC C AG 

2 401 TCCCTGCAAA GACTTTAAAA GCTGCTTGTT CACATCATTA GATTGTACAA 

2451 CGCTTGTACA ATGACACTTT TTGCTAATCT ATGCAACATT TTTGTAACAA 

2501 TTGTGCACAT TTTAACTACT TCAGATAATC AGGACCTAGA GACTTCAAGA 

2 551 TC TGGAAGC A TTGCTGGTGA CATAGAGCAA AAACTTTCTT GAGAATAGGA 

2 601 AG TC AG TG TT TTGACAAGTG ATTTATAACA GTTCAGGTAT AGCCAGGAAG 

2 651 GTTTGAAACA AACCTTAAGT ATTATTTCTT TCATCTTGAT TAGTATATAT 

2 7 01 TTATATGTGA TCTATTTATG TATATTAATA GATTTTTGGG TCTTATAGCC 

2 751 AGCTTTCATT TTTCTCTATT GGAAAAGATC TAAGTCCCCA TCCTTCCTTG 

2801 GTGGCTTTTG GTAGGTTTGT AGACAAAACA TTGAAGAATC AATGGTACCT 

2 851 TTTATACATT AATACTGCCA ATATGACCAT AAAATCATAT TTTTTGGGAA 
2901 TTTATTCCCC CGATCAAAAG AAGCATTTGT TATTGAACAC AGTCTTATGC 
2951 TACCTTATTA AGATGTATCA AACACCCTGA TTGATCAAAA ACACCTCAGT 

3 001 CCATTTTAAG GCAGTATTGC CCAGCAATTA AAGATGTAGC TTCTGGAGGA 
3 051 GTCTTTCTGA GTTTGAATTC AGTACTC TTC CACGTACTAT ATAGGTGATC 
3101 TTGGGTAAAC TTCTTGAGTC TCAGTATCCC CATCTGTAAA ATTGTTGTAG 
3151 AGAAGAATTT TTGTGATGAT TAGGTGAGAG AATATATTAA TGTAATATTT 
3201 AGGAGAGCAA CCAGCATGTA GCATATATTC ATTACATATC AATTTCTATA 
3251 TTATTGATGT TCATACTGCT GATGTTGAAA TGCACAGGAA GGCCACAGTT 
3 3 01 ATTTTCTGTT TAGATTGATT TTTCTTTTAA AGTCTGAACA TAAACTGTAA 
3 3 51 TACTGTGCTT ATTT ATG TAG GAACTGTGAT CTCGTCTCCT CCTTTTCCCA 
3 401 TCTCCCCCTC TCTACCTTAG TTTTTCCTTA TAGTCTCAAG CTGAAAACAA 
3 451 TGACCAGGTG CCTAAGAGAT AAGAATACTC TTTC TTTTG A ACTCATGGCA 

Fig. 4 (cont'd 1) 
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3 501 TTAGCAGTGA CCTGGATGAG ATTGGAGGCT ATTATTCTAA GTGAAATAGC 

3 551 TCAGGAATGG AAAACCAAGC ATTGTATGTT CTTACTTATA AGTGGGAGCT 

3601 AAGCTATGAG GATACAAAGG CATAAGAATG ACACAACAGA CTTTGGAGAC 

3 651 TTGGGGAAAG GGTGGGAAGG GGGTGAGGGA TAAAAGACTA CAAATAGGGT 

37 01 GCAGTGTATA CTGCTTGGGT GGTGGGTGCA CCAAAATCTC ACAAATCACC 

3 751 ACCAAAGAAC TTACTCATGT AACCAAACAC CACCTGTTCC CCAGTAACCT 

3 8 01 ATGGATATAA AAAAATTAAA AAAAAGAAAA AAAGAAAACT CTTTTTTGCA 

3 851 GGGGGCAGGT AAAGGGTAAG AGGGCATCCC ATTTTTGAGT TTCTAGAAAA** 

3901 GCTT 



Fig. 4 (cont'd 2) 
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Human genomic sequence 



1 


CTGP.AGGAAG 


CAGCAGCAAG 


GTC CAGGG AG 

VJ X X*- w Wx*\J 


CC TCT AATTT 

X Vrf X ^i-tx XXX 


AAATAGGAGA 


51 




TTAACAGCAT 


TGACAAAAAC 


AAGCCTCCAA 


ATTATGCAAA 


101 


TGGAAACGAA 


AAAGGTAAGT 


GTTTGTTACA 


TCATTATGAC 


ACAAGTCCAA 


151 


C ATGAG TP TT 


GTGAATTGCA 


TGCTAAATCT 


AATATTTGAG 

**** X ** XXX W**VJ 


C AGCG TAAC A 


201 


ACTTTGGGCC 


TAGAGATGTT 


ATCAGTGGAG 


TTTPTTTATG 

XXX V— XXX **■ X VJ 


TTTPPTAAPT 

xxx v_< v_- x fvn\> x 


251 


GTCCPCTPPT 


GAC TGC C AG C 


TTTCTTATPT 

XXX Nw> X X T\ X V X 


GAAGAAPATT 


TTA A AP A A AT 


Jul 


x>/vri.\«. X v»A X X v_ 


ATTTTA A AG T 

X X X X AnA\3 X 


AGTTAGTTAT 

*\\J X X X X X 


ATATGPA APT 


AP A A ATAPTP 




mrpmpmp a A A A 


APAGGTPPTT 


PPA A ATGPAT 


PTA A ATPAPA 


XXX IV- X lAlu 






rnrrxmrnrnr^ A A A A 
11X1 luAAAn 


TPfpaTppTpa 

lulAlLLi VjtA 


AA 1 v_. A 1 AAAlj 


LLA1A1 1 CjAA 


/CI 


11 1 A IV 1 \j AA 


1 UL. 1 1AAL 1 1 


r-i r prnmy, a fT-""p 
UALa 1 1 AAL7L7 1 


7> XpTlpppTi rp> 

AAbAbL C- A 1 A 


AGTGTTTTTG 


501 


ACAATTAAGG 


TTGGAGCATC 


AAAATTTGAA 


ACATAATTAC 


AGTAGGTTTT 


551 


TATCTTTGCA 


7\ f-*t~* 7\ /"*»px A O A m 

AGCAGCAGAT 


C C C AG AG AT A 


TT ATG AC C TC 


AGTTTTCCCC 


601 


AAAAGACAAA 


TTATTCATAT 


TTGTTTTGTT 


minpftwTip a a mm 

TTC TTG AATT 


AGTGCATAAT 


651 


ATAAATATCA 


A. A rn/*"* A ^ A A. A A 

AATLALAAAA 


TCAAGGACAT 


mn ti t\ mp aaa p 

TAAATGAAAG 


TGTCTGTTAA 


/ Ul 


AGGCATA1 1 A 


rp A A A nrv~" A A t»o 
1 AAA IbAA 1 L. 


7i m> >iPppA a 
A 1 AAt^L. UAL. A 


p A mrnpimr"T>P 

CA(j lTLTLTG 


TGATG TACGA 


TCI 

/ DI 


Atj 1 (jtjtjrL- A 1 1 


1 AA-ALjA^vj 1 V7 


O'TV™* i mf PfPP ft rp 
V- ITjA 1 1 iuA 1 


^jL- 1 Ibl L. A\_ 1 


PXPTITAPPTVP A 

1 AbLAGA 


O Ul 




A TP A PT a HPPT 


\j X riu X x 1 


PPTP A A TP A m 
LL1 \— 1 X 




ft R1 

O J X 


ana apttptp 


PTGTTATAAG 


TAGTAT^GPT 

X AO X X UOv X 


PTPTPAPPA A 


PTAPPPTPT^P 


9 01 


P TG P TGG A TT 

* — X Uv X vOA X X 


TTGAGGAAGT 


£%. X X X X X 


PA AT AG A APT 


TP AAA PTTTT 
X U/viriL X X X X 


951 


C TTP AG AGTG 


TTGGGCTCTA 

X X VJVJVJ\— X w X XT 


CATGGAAAAA 


PAP ATG AAAT 


TA AAA T^<WO.O. 


1001 


CACAAATGTT 


TAGTTAGTAG 

X iiVJ X X X X*NJ 


MCATCTGGC 


T A ATTGGG AT 

X flil X X X 


PA A ATA ATTP 


1051 


AACCATGTGG 


GAACGTTTTT 

VJxMiVw X X X X X 


GC TC AAAATA 


GATAATTGTG 


A A TTGTTTP A 

AA X X \3 XXX V— 


1101 


TATAGGCAAA 


TGATTAGACA 


A PTTPPTP TT 
x xv'V^-xv^xx 


PPTP AAA TGT 


PA APGPAPAP 


1151 


ATG TGATC T A 


GAAGCAAGAC 


A PTPTTTTGT 

/i^. — X \— X X X X \J X 


GTA A ATATTP 


nnrprpmpppprp 

V- V- XXX VjVjv,V_ x 


12 01 


A A AGP A A A AG 


TGGAPAGAPT 


TTA A APAPPT 
X X AnAV.nU L X 


GAP AGP AG AP 


P A P T*P TP T»P T 1 


1251 * 


TAAGATTGPA 


ATATPTTA AP 


OAO X X 


A A ATPP A A A A 


*T*P AAAAAPAa 


13 01 


A AGTGTATAT 
ririvj X vjr X a X r\ X 


TTGGAAGTTA 

X X OVJj.rt-tt.\J7 X X A 


r^r; a A Tf^TTTT 

VjfUflftJ. Vj X X X X 


prprprriA t\ A A rpA 


TAAAATAAAa 


13 51 


TTTTAP ATTT 
X X X X r\\Jr\ XXX 


A Af^ ATP ar A A 


r;aa ATATTkP 


tp a anapTTa 


rp a prppi iwnpipm 
iAL 1U11 LL 1 




GGGGCTAAGG 


G AGGTG AC AG 


TCGCTCATCA 


GAAAAAAAAA 


AATGCCCTCA 


1451 


TTTCCTAACT 


TTTCTAAAAA 


ATATAATACA 


AGTTCAGGCT 


AATACTTCCT 


1501 


GTATATGTGG 


GAAATTTCTA 


GGGGAAGCTA 


ACAGGCTTAG 


AAATAAAGAT 


1551 


GTGTTAAATA 


GACTACCAAA 


GTGTCCAATT 


AAGCAACACG 


ATACCACCGT 


1601 


TATTGATATT 


CTAGCAAGAA 


ATTACTAGCA 


ATGTTTGTAA 


ATAGACTTAG 


1651 


AAATGCATTT 


GATGAATTAA 


CACTTTTATA 


TCTTAATTTA 


TCTGAATTTT 


1701 


TCTGTAATGT 


GAAAATGTTT 


TATTTAACTT 


ATTTC TGGC A 


TCTATTAGTA 






Fig. 


5 
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1751 


AAATTCTGAT 


GATATACAAG 


CATTAATATT 


TTTCCATGGC 


CACTCAATTC 


1801 


ATACATACCT 


TCCCTATCTA 


TGCTTAGAAG 


GC AG TGC AAA 


ATTAGATAGT 


1851 


AGCAATATTG 


ATTATAACCA 


CAAGGTGGAG 


ACAGATGTCA 


TGTAATATGC 


1901 


AGTCTGCTCA 


TATAAAGCAC 


ATTTTCTTAG 


ACAAGAGTTT 


TCATACGATA 


1951 


TAATAAAGAC 


ATCTGGAATT 


TGTCTTGTAT 


GCAATATGAA 


ATTTGCTATT 


2001 


AAACGTGGAG 


TTAAAACTTT 


ATGTCAATAG 


ATCCAATAAC 


AATGTTCATA 


2051 


AATTAATCAT 


TATGTCATGC 


TGTATTTCCA 


AAATACTATC 


TTAAATTATA 


2101 


AGAGCAAACG 


AGGTAATAA 









Fig. 5 (cont'd) 



59/ 124 
Human genomic sequence 



1 


GTACATTTTT 


TAATAAAGAT 


GTTTGTTTTA 


ACTTTTTGAA 


TATGAAGATT 


51 


TCTAGTTCTA 


GAATAATGTT 


TATAAAAATA 


TACAAATCCA 


TCTGGTGATG 


101 


AGTTGACCTC 


TATCACAACT 


AGTTTGCATA 


TATAACTTGG 


GTGTGACCAA 


151 


GCAAGGTGAG 


AGTTAAGAAC 


TTTTAAAACT 


TACTGTATTA 


TATTGATAGA 


201 


ACTCAGAAAG 


TACTAACTTG 


AATATTATTA 


TTCTAATTGC 


TTTTCCCTTT 


251 


TAGTTATTAA 


AAATAAGAAT 


ACTTAAATTA 


ATAACAAGAT 


CTTTTACTGG 


301 


CAGGATTAAC 


CAAATTATCT 


GTAATG TGTT 


CCTCGAATGC 


TTTTAAGTGG 


351 


AAATATACTT 


TATACATTCT 


TTAACAACTC 


TGAGAGGATG 


AGTTACATAA 


401 


ATCAGTTCAG 


GAATCTATAG 


AATCTGTAAT 


ACATAGTAAA 


GGTTTATTCA 


451 


CAATTAAAAC 


AATTTCACTT 


CTATATTAAA 


AAAACAAATT 


GTTGAAAGTA 


501 


CAGTGGCTTT 


TCATATGTAT 


G ATTTG T AAA 


ACAAATTAGC 


TTTTTTAAAG 


551 


TGATGTGACG 


CTTAATGAGA 


AGAAATCAGT 


AGAGAATTAC 


AAACTGCACT 


601 


TCAAAAGATA 


CATCTAATAT 


CATTTTAATA 


ATGAAATTTG 


AAAAAATAGT 


651 


GTGCTCGTTT 


TACAGTCTCA 


TTAAATGAAT 


TAAAATATCA 


GCACACATTG 


701 


TAGTAGGTTA 


TCATTGGCAG 


AGAAGGCTGA 


AATAGAAACG 


TTACAATGGG 


751 


ATGCACTGCC 


ATC TGAAC AT 


TATGTCGAAG 


TGGAACGCGG 


AAACATATTT 


801 


CTCAGAACAA 


GTGGTAAAAT 


GAAAACAGCA 


TCATTTGTAA 


AGCATTTCTT 


851 


TTGAGAGTGC 


TTCAGTTTCT 


TCTCCTGATG 


ACCTGCCATT 


CAGAAACTGA 


901 


CAATGAATAA 


TACACTCTGA 


CACCAGCATT 


TGTCAATTTG 


CCCAGAACCA 


951 


TATGAGAGTA 


CTCTAGACAG 


ATATATGTTC 


CGAAGTAAAC 


CGAATACCTG 


1001 


TTAACTGTAA 


ATCAAATCTT 


GTAGAAACCA 


TGCCATGGTT 


CCTTTGGACA 


1051 


TATACTTTGC 


ATGCCTGAAG 


CAAGTTACCT 


TAAGAAATCA 


TTCTTTTGTT 


1101 


TTACAAAACT 


TGTATTAAAA 


AATTAAAAAT 


GCAAAAAAGC 


TTAATATTAT 


1151 


TAGGAATTTA 


TCCATAGCTT 


TATTTGGAAT 


CCAGTTTCTT 


TATTATGATC 


1201 


TATAAACATG 


CATCATTTGA 


TGGAGTTCCT 


TAGTGGAGAG 


GTGTTTTTCC 


1251 


ATGTTGCTAA 


GAAACATGCC 


CCAGCACCAG 


AAGGGATACT 


ACCTACCATC 


1301 


TTTTTGCCAT 


TTCTCACCGT 


GATTCTTACA 


TTGTACCTGT 


TTACTCACTG 


1351 


AACAGGGCTT 


CCTTCTCTTT 


GTCTAGATTC 


TAATCAGGTG 


TCTTCTGGTG 


1401 


TGGAAGCTTT 


GGCTTTTATT 


TACACACAAC 


ACAGAATTAA 


TAAGATAGAT 


1451 


GCCAAGGATT 


TAGCAACATT 


TTAATTCAAC 


ATTATACAGG 


TATCAGAGTT 


1501 


AATGAGAATT 


ATGCATTAGT 


CTTTAAATTT 


GGGCAGCTTA 


TTCAGCTAAA 


1551 


ACATAGATGT 


CTAGCTCTTA 


AACACTTTGT 


TTTTTTAATT 


ACTCTGAAAT 


1601 


TACAATAAAG 


TCAAAGAACT 


GAACTGTTTT 


CTTTTCAAGC 


CAGTGCAAAT 


1651 


GTGCTTTAGT 


TATTATTTTA 


CTGGTGATCT 


AATTATGCAT 


TTTAATGCTT 



Fig. 6 
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1701 


TATTACTTAA 


TACTTATATA 


AGCCTAAAAT 


ACGTTGTTAA 


TGTCATAATT 


1751 


TCAGGGATTT 


TAGTATTC TT 


TCCATGAGTT 


ACCATAACTA 


GGTGCATATG 


1801 


TGTAAATATA 


CGTATATATC 


TATATCTATA 


TATTTATATC 


TATGTATATA 


1851 


TCAATTTATA 


AGACTAAATA 


GACTTGGCCA 


TATGTGTTGT 


TGGTTTATGC 


1901 


ATACATGCAC 


AAATATTGAG 


GTGTCCACAA 


AGTATATATG 


CCTGTACATA 


1951 


AATTACATAC 


TGGCTGGTGA 


GTGAATGTAA 


GCTTCTCTAA 


ATTGTACAAC 


2001 


TCTCCACAGA 


GTGGCACTCT 


AATATTGCAA 


AGGTACAATA 


TAAGCATGTG 


2051 


CAGAATGAAC 


AGCTCTTCTA 


GGATCCCTAT 


AAAACTCCAC 


CCCATGTTTC 


2101 


TGT 











Fig. 6 (cont'd) 
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Human genomic sequence 



1 


AAGL. 1 1 L- A I L- 


L-L. AVj LsOo L. 


•k r"v r rczr t c z. c a 


1 oL-L- X 1 AO 


AOL- 1 L, 1 L,L 1 G 


bl 


rpTv TV 1 TA/T" 1 7A /^T 1 








Ks 1 L. AGGAGGL. 


101 


ACGGGGG 1 LA 


GGGALL.L-AL. J. 


1 VjAVjVjAVsoL i 


pmnmpmpp/ifTi 

G 1 1, Ibittt I 


1 AGCGG AGG T 


151 


AGAACACTGT 


GL, 1 LbbAbA 1 


LL-LtL- XAjtV_ 1 V, I 


LI i LAbAbL I 


GGCAGGCAAG 


201 


AGTGTTTTAG 


TCTGCTGAGC 


v- rGCGCCCAC 


AGCCGCCCCT 


TCCCCCAGGT 


251 


GCTCTGTCCC 


AGGGAGATGA 


GAGTTTTATC 


TGTAAGCCCC 


TGACTGGGGC 


301 


TGCTACCTTT 


CTTTCAGATA 


TGCCCCGCCC 


AGAGAGGAGG 


AATCTAGAGA 


351 


GGCAGTCTGG 


CTACAGCAGC 


TTTGCCAAGC 


TGCAGTGGGC 


TCTGCCCAGT 


401 


CCAAAATTCC 


CAGCGGGTTT 


GTTTACATTG 


TGAGGGGAAA 


AGCACCTACT 


451 


CAAGCCTCAG 


TTATGGCAGT 


TGCCCCTCCC 


CCCACCAAGC 


TCCAGGGTCC 


501 


CAGGTGTCCT 


TCAGACTGCT 


GTGCTGGCAA 


TGAGAATTTC 


AAGCCAGTGG 


551 


ATCTTAGCTT 


GCTGGGCTCC 


ACAGGGGTGG 


GATCCACTGA 


GCTAGACCAC 


601 


TTAGCTCCCT 


GGCTTCAGCC 


CCCTTTCCAG 


GTGAGTGGAT 


GGTTCTGTCT 


651 


CACTGGCATT 


CCAGGTGCTA 


CTGGGGTATG 


AAAAAAAAAA 


CTCCTGCAGC 


701 


TAGCTTGGTG 


TCTGCCCAGT 


TTTGTGCTTG 


AAACTCAGGC 


CCTTGGTGGT 


751 


GTGGACACCC 


AATGGAATCT 


CCTGGTGTGC 


ATGTTGTGAA 


GACTGTGGGA 


801 


AAAGCATAGT 


ATC TGGGCTG 


GAT AGC TCCG 


TCCTTCAAGG 


CACAGTCCCT 


851 


CATGACTTCC 


CTTGGCTAGG 


GG AGGG AG TT 


CCCCAACCCT 


TTGCACTTCC 


901 


C AGGTG AGG C 


AACACCCCAC 


CCTGCTTC TG 


CTCACCCTCT 


GTGGGCTGCA 


951 


CCCACTGTCT 


Tt Tv TV /"l m/ 1 ! Tl ji^I 

AATC AG TC AC 


TG TG AG ATG A 


GCCTGGTACC 


TCAGTTGGAA 


1001 


ATG C AG AAAT 


CACCTGCCTT 


CTG TG TTG AT 


CTCACTGGGA 


^1 f \ TV Ta j^I TV ^TfTl 

GCAGCAGACT 


1051 


GGAGCTGTTC 


C TATTC AGC C 


ATCTTTCTCA 


z^i m/^ tv m tv tv m/"** 

GG TC AT AATC 


ATAGATTTTT 


1101 


AATTG ATC C C 


TV /"» TV TV TV ITI/^I ^ 

AGCAACATGG 


ATTAGTAAAC 


tv /i /""i tv m tv rrfcmm/^t 

AGCATATTTC 


CAAGTGATTT 


1151 


TTTTTTATTT 


TAAGGTCAAA 


TC TAG AAAAT 


ATTATAGTGT 


TATCACCACT 


1201 


TAAAATTATT 


AC TGG TG AT A 


CTATGTTTGT 


CTC TATTC AC 


ATTTTATTGC 


12 51 


m TV TV TV TV /"^ TV TV fTl 

TAGAAAGAAT 


TATAATTTGT 


AGATAATAAT 


AGTTATTTGA 


AATGTATTAC 


1 J vl 


AT ATC C TTTT 


Tv /^immmm tv tv ta 

ACTTTTAAGA 


TV TV /^i m^—* < TV *"»m 

AG AGG TG ACT 


TAATTATC TA 


/"*f*m Tv m tv j—i Tv tv m 

GGTATACAAT 


1351 


TATTTTGAGG 


ATACTAAATG 


TCATGAATAG 


CAAATTTATC 


ATATTGCTTT 


1401 


CCTAGGTGAA 


GACCCTGAAA 


CAAGAAGAAT 


GAGAACAGTT 


AAAAACATAG 


1451 


CAGACTTGAG 


GCAGAATTTA 


GAAGAGACTA 


TGTCCAGTCT 


TCGTGGGACT 


1501 


CAGATAAGCC 


ACAGGTTTTT 


TTCAATTTTG 


CATATATTTG 


AGCCAATAAA 


1551 


GAAAAAATAA 


TTACAAACAA 


ACATTTAACT 


TTTCTTATAA 


TGACAGAGAT 


1601 


GGGATTTCAG 


TTTCCCCTTA 


CTATTTTCTC 


CCTTGTTTTA 


TATCAAATTG 


1651 


ATTGGTAATT 


ATCCTTAAAC 


TGAGAATTCA 


C AG TATAT AC 


CTATTTATCT 


1701 


TTTATCTCTA 


TCTCTATCTG 


CTATTTATGT 


CTTTTTCAGT 


ATAATTTCCA 



Fig. 7 
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1751 


GTACTGCAAC 


TACCACCATC 


ACTGTTAAGT 


GGATTTGTAA 


TACCTGTCCT 


1801 


AGAAAACAGT 


GGCACAAGTT 


GCACTTGAAA 


TGCATCTGGG 


CAGGGTAGTA 


1851 


GGGAGACATT 


CAAACATAAT 


TGTAGTTAAC 


TTTCAGAATA 


GGTCTGGGAA 


1901 


GGTTACAGTG 


AGTTAAGGAT 


TTGTTGAAAA 


TGTAAAACAA 


TATGTTGTTT 


1951 


TACCCAAGGT 


GTACTGATGG 


CCTTTCTTTT 


GAAAACAAAC 


GAAAAGCTAT 


2001 


AAAATGTATG 


CCCCTTTCCA 


CAATTTGACC 


TCAAAATGAA 


TATAGAGTTT 


2051 


AGCTTTCGGG 


AAGATGACGT 


GTTTATAAGA 


GATGACCCTC 


AACTCCAGCC 


2101 


TTTTCTGTCT 


TCATGCATTC 


TAGATTATGG 


CCCTAAGTGA 


ACCAGAGTAT 


2151 


AG TTATTTCT 


CCATTTTATT 


TGACAGCACC 


CTGGAGACAA 


CATTTGACAG 


2201 


CACTGTGACA 


ACAGAAGTTA 


ATGGAAGGAC 


CATACCCAAC 


TTGACAAGTC 


2251 


GACCCACCCC 


CATGACCTGG 


AGGTTGGGCC 


AGGCATGTCC 


GCGACTTCAG 


2301 


GCGGGAGATG 


CTCCCTCCCT 


GGGTGCTGGC 


TATCCTCGCA 


GTGGTACCAG 


2351 


TCGATTCATC 


CACACAGACC 


CCTCGAGGTT 


CATGTATACC 


ACGCCTCTCC 


2401 


GTCGAGCTGC 


TGTCTCTAGG 


CTGGGAAACA 


TGTCACAGAT 


TGACATGAGT 


2451 


GAGAAAGCAA 


GCAGTGACCT 


GGACATGTCT 


TCTGAGGTCG 


ATGTGGGTGG 


2501 


ATATATGAGT 


GATGGTGATA 


TCCTTGGGAA 


AAGTCTCAGG 


ACTGATGACA 


2551 


TCAACAGTGG 


GTAAGTAACC 


CTGTTCTCCG 


TCAGCATTGT 


GTGAAGAGGG 


2601 


GAGGTGGTCT 


ACTATAATGC 


ATTCACTATA 


AACAAATGTG 


TAAGTTTGCC 


2651 


CAGAAAGTCA 


TGAGAACATA 


TGAGATATCT 


GAGGTTATTC 


AGAGTGTTGA 


2701 


AGGGCCCTTC 


CTCTGCTCAT 


TCATGGAGAG 


TAAAGAATCC 


AAGATTTCTA 


2751 


TAAATTCATT 


ATAAGCCGCT 


AAGTTTTTCT 


GTTGTTGAGA 


GAAACACATG 


2801 


TGGCTTCTGT 


TTTTCAGAGT 


GATTTTCACA 


TGCTTCTTAA 


GTAACAGATT 


2851 


TTGTAGTTAA 


GGACGTGGGA 


AGGAGACAGG 


AGGAGTTTTG 


CTGATTTGCT 


2901 


TGATTTTTTT 




AGCTTGTTAG 


AAGCGGCCTG 


TAACTGCTTT 


2951 


GAGAAACAAA 


TATTTTCTTA 


CTGTCTTCAA 


TTATGCATCC 


CCAATTTAAC 


3001 


TTGAGGGAAA 


AATCACTTTG 


GAGTTGAAAG 


TTTCACTCTA 


TTCATTTTCT 


3051 


TTTGATGGTA 


TCAGATTTCA 


ATACATCTCA 


GACCCTGTTT 


TTCTTCTGTG 


3101 


TCCTATTACA 


TTCCAAAACA 


TGTTGTGATT 


GTAAAACTCT 


TAGAGTATAT 


3151 


TAACAATTTG 


GGATATTTGG 


CATAATCAGA 


GAATAGGTCC 


AAAAGGAGGC 


3201 


AATAGGATAT 


TCTATTAATA 


ATTGTAATTG 


CCATTTTTAG 


CATTTCCTGT 


3251 


TATGTACTAT 


GCTCTTGTCA 


AGTGCTTTGA 


AGATAGTGTT 


TT AC TTTTCC 


3301 


TTCCCACCAC 


CAGCAATGTT 


TATGAGGTAG 


ATGTTTTTAT 


ACATGTTCTA 


3351 


TGGATAAGGA 


AACTGAGTCT 


AATTGGCCCC 


GGCTGGGAAC 


TAACGCTAGG 


3401 


GAAACGGCAG 


ACCTGCATTA 


GAACTCAGCT 


ATGTCTGACT 


TCAAACACAG 


3451 


GCTCAGTAAT 


ATGTGGAAAA 


GCTTCCCAAT 


TAACTTTGTC 


TATAAACTTT 



Fig. 7 (cont'd 1) 
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3501 


GTGTGAGTCT 


GGATTTTGAC 


TTAC TC TTTG 


TPTTTA PPP A 




3551 


CCCATGTAGG 


AAATAATTCT 


TC TATATAAG 

X X *X X x*i*Vj 


TP. A P P P T T P P 




3601 


CATGAAAAGC 


TTATGTTTGA 


AGGGTGACAP 


PAPPfA A A A A 


AljAij 1 ALAAA 


3651 


ATAGCTTTTG 


ATTACATTTA 


TAGCTTTGPT 

X xxx X 


PTP A f A TPPT 


AAiALL J. ACT 


3701 


AGTCCATTCC 


TGGTATCCAC 


CCTAPPTGAP 


x X X V— iftAAAA 


i I IAGAATTA 


37 51 


TAGAGACTAA 


TTATGATTAA 


x x iXtWDri. X 


x x\j x lul I C A 


taTTGCCACTG 


3801 


G ATTC AG AG T 


GPPTAGTTTP* 




A J. IVACTATC 


TGTGGACCCC 


3851 


TTCGGAACCT 


AAPPTATPPA 


i\r\ x X r\\3 X X X X 


rG TC ATC TAG 


AATAAGGATA 




a a attptapp 


rl±A_ 1 x LAlbA 


AvjX 1GTTAGG 


ATCATCCACA 


AATTTTAGTT 


3951 


TGCGCAATGC 


TTGGCATGAT 


ACAAGCACTC 


AATAAATTTA 


TCATCTTCCT 


4001 


CTTTATCATC 


ACTATTACAT 


TTATTATCAT 


TAATAACCAT 


ACCAATTTTT 


4051 


GGTTGTTGTT 


AGTTATAATT 


ATCATTTTTG 


TATGTATTTA 


ACATAGCCTA 


4101 


GGAGGCAATG 


CCCAGTTCAG 


AAAACATAAT 


GGCAAAGCAA 


GAGTGTCTAA 


4151 


GGCACACTCT 


TTCTCCCATC 


TCTCTCTTCT 


TTCTTCTCCA 


TTC TTTCC AC 


4201 


TCTATCCgCT 


CTTCTCTTTT 


TTTTCTCAAT 


CTCCTTAGAT 


GTGGACATAT 


4251 


GTGTGAATTC 
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1 


TGTGGGTGTG 


GGTGTGAAGC 


ATGTGTATGT 


GTGTGTGTGA 


AGCATCTP CC 


51 


CACCTGTAAT 


GTAAGTCCAT 


GAGTGCAGAA 


TTTTTGACAT 


ATTPTTT A PP. 

X X V_ X X X AW \ J 


101 


TGTTGAGTTT 


TAACAAATGT 


TTGTGGAGTG 


AATGAACAAA 


TTA ATP.A ATA 


151 


TAGGCTATTT 


ATTAATTAGG 


CAATATAGTC 


ACATAGGCTG 


GPA ATPP.P AT 
wLnn X A, X 


201 


CTAATTAAAT 


AGAGTGGTAA 


ATGAGTTCCA 


G AAAG AAC T A 


AP,P»TAPTAP A 
nv»\j X nL X A\_ A 


251 


AGGATGTTAT 


GAAAGAGAAA 


AATGAGTTAT 


GTT5AAA A ATA 


PP A CI 7A f™* 7a TV 

ovaAoAv_ Avj 1 \j 


301 


ATAAGAGGGA 


AAGAATCCCA 


AAGTGTGGGC 


PAPATTTTP.A 
\>nLn x x x X 


A APT 1 A 7A TV 7v r* 
AAL- 1 AA 1\jAL 


351 


CTATTATTCT 


ATT ATTG TT A 


GCTGAAAGTA 


P.AA A APf2TPA 


(TY^PP 7a r*t~*f^ 7\ 7v 

i\j^jljALstj(jAA 


401 


TATCTGCTAG 


TTTTTGG TAA 

X X X X X \J\J X 


A GG A TP»TTP T 


Vj A iuuLAuAA 


C C aAG AAATG 


451 


AACACAAGGT 


GACTTTGGTT 


Tanner a p a r^T 


A J. AA X L-A 


AC TCTCCTTG 


501 


CTCCATCAGG 


GCCCC AGAPT 


vjuvju x ^ lowL 


AuAvjuAAL 1 L. 


AGAACAACGT 


551 


AAAGACCTAG 


ATAGGT ATP T 


A ATA A ATTVn: 
■nAXAAAX lou 


tsAL-L. lo 1 (jAA 


AACAGTGCCT 


601 


CTTAAAGTGT 


GGTACCTGGA 


ppap^pacipap. 




AGCCATTGAA 


651 


ACTTCAT^GA 


AAGACAGATT 


c TP A nrTTP a 


1 LtAAbAL I I 


AC TG AATT AG 


701 


AATATCTCAA 


GGTAAGGCCT 


GP.T A A TPTP A. 

\JVJ X fin X \_ X VJJjri 


W?V_X I 1AAL 1A 


(jCCCTCAAGG 


751 


TGATTCTTAA 


G TTC AAGC AT 


P APT 1 AT ATT A 


A f^T"W A A r 1 7A 7A 
rtul XVjAAL-AA 


ATAGATGC C A 


801 


GGCCTATAAA 


TACATGTAAC 


GCCTAG C ATA 


AATATTTPA A 


P* 7a rnrp ?i > r > t\ m 
tAl 1AAAAAT 


851 


GACATTTCAT 


AG TTCTTATT 


TAC CC TATTA 


PtPTP* TPTTPT 

wv- X V7 X V3 X X v_ i 


prpp 7a 7A 7A T> > 71 


901 


TGAGAATATT 


GATATGTTAG 


AATACACTGA 


TGPAPTA ATT 
x vvnv> x nn x X 


X 1 1AAA1 IACj 


QC1 


A TV 7a 7A 7A T 1 7\ 7A T 


bACTTGTTAT 


ACC TGAAATA 


AATTGGTTCA 


GCTTGGTAGA 


1001 


TGCAGTTTTT 


GAGAATTATA 


TAAGTCATTT 


TTAAAAGAAT 


AATTTTAACT 


1051 


TGAGCTGCTT 


GCATAAATTA 


AATTGCAAAA 


AGGTCATAGT 


ATAAATCCTC 


1101 


CTATTAGCAG 


AGATAGAAGG 


TTTTTAAAAA 


AATTACAGAT 


AAGTCTGAAG 


1151 


GTCTTTTAAA 


ATCTTATATT 


CAGGAAGTGA 


CTCGGGATGT 


ATATCATTTT 


1201 


AAAATACATG 


GTCTTAAATG 


TTGTAGTTGT 


ATGACTCTTT 


CAGTTAATTT 


1251 - 


AAAAT AC TTC 


CTTCTATGAA 


AAATTGTTTC 


AAAAATTTTT 


CTAAATTCTG 


1301 


TTATCCATTT 


CAAGTAGGAT 


AGGCAAGAAC 


AGATATAAGA 


TACTACTTTT 



13 51 TTGTTCATGT TTACTAAAAA AAAAATTACT GTAATTGAGA TCATGTAAAA 

14 01 ACATGTTTCC TGTCTATTTG TCTTAACCTT TTAATCCTGG CACCTTAAAT 
1451 TTGACATAGT AGGAATTAGA AGACAATTGC AGAAAATGTC AACTGGGGAA 

15 01 ATTTTATTCT ACTAAAAACT ATGTCCATAC AACATAGCAA ATCACATTTT 
1551 AAAGGCCAAA AAGTCTTTCA TAGCAATTTT TCAGATTATT TTCAAAGCAT 
1601 ATCTTCTCTC TGCTCCTGCA GCATGCCGTT GATTTTTCTG TTATGC AG TC 
1651 ACATAAGTAA TTACATGTTT AC ATGTC TAT TTCACTCATA GAACACGAAA 
1701 CAGTTAAATG TAGAATAATA TCCAATCCAT CTTTTTATCA CCAGTAGCTA 

Fig. 8 
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17bl 


VjC A I AL lblrt 


bbAAL 1 Lnn 1 


A 1 X rp a rp a rp/~t t\ 
AAA1A1A1LA 


Vj A 1 AAA 1 1 Lj 1 


f^r* AAA m a a r**f~s 

GG AAAT AAC C 


1801 


ATA I CAGL. 1 1 


A mA A A rp A rp A 
A 1 AAtA I A 1 A 


CI 1 A 7A rpo A r* 
bAAA 1 La I bAu 


1 1 JLAAAAAbA 


AAACAATTAT 


1851 


TV O A rp A rp/"" AAA 

ALAlAlbAAA 


All rpi i ii 1 trprn a rp 
AAA! ill 1 A 1 


A PP A rnrpmrprrtrp 
AtLAi 11111 


1 AAAbALL 1 1 


1 C AGATGTC A 


1901 


I AL AG ill bo 


At 1111 L,L-ALj 


TT2' I "pip/^rri'TV'-irn 
loi 1 itllbl 


A 1 L- A 1 GAGAL. 


A A m A /"tin A f> H O 

AATAG TAG AC 


19 bl 


A J. lblAAAit, 


AAAAA 1 Ab 1 1 


rprp/*^ fT\f-if-*f~*f~\ rp rn 
1 1LJ. LxLiLib/ 1 1 


vj 1 \s 1 AL. A 111 


GAAAAAACTG 


2001 


a a rp a mp A rp a rn 

AATATCATA 1 


rp/^m rp/"*n*rp a O 
C 1 vjj 1 it J. 1 A\j 


AbAb 1 AA 1 bA 


rnpA A rp A mm A A 

TGGATATTAA 


CATATCAAAG 


2051 


G TAC AG AGAA 


G TCTT AAAGT 


TCAAAGTAAC 


ATC TGC TT AA 


TTGTATTTAA 


2101 


TTCAGTGCTC 


CATGAGCTTT 


TTTATCACTG 


ATTCCCTCCC 


TTTTTTCTCT 


2151 


TATGATAATA 


ATTAACTTGT 


TCCTGTAGCA 


TTTTAAGAAA 


TGTTGATTTA 


2201 


GTTGAATGCC 


TTCACTTCTC 


CAATATAATA 


GCAGAAACTC 


AGAAATATTT 


2251 


ATTTACCCAG 


AATCATGCAG 


CTAATAGTAC 


AAGGATTCAG 


GTCTTTTACT 


2301 


TCCTATTTTG 


TGGTTCCCAA 


CTACTTTTGC 


CAAAGGTCTT 


TTAAATAATA 


2351 


TGAAACATAT 


TAGTGATTGA 


TTCATTATAG 


TAAATGGGTA 


AATGATAAGG 


2401 


CTTGCAATAA 


TTCACTGACA 


AGAAAGCTT 
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Murine cDNA sequence 



1 AAGCCACAGCACCCTGGAGACAACCTTTGATACGACTGTGACAACTGAAGTGAATGGAAG 
SHSTLETTFDTTVTTEVNGR 

61 GGCCATCCCCAACCTGACAAGCCGACCTTCCCCCATGACCTGGAGACTGGGTCAAGCGTG 
AIPNLTSRPSPMTWRLGQAC 

121 CCCTCGTCTACAGGCTGGAGATGCCCCCTCCATGGGCGCTGGATATTCTCGAAGCGGTAC 
PRLQAGDAPSMGAGYSRSGT 

181 CAGCCGATTCATCCACACGGATCCCTCCAGGTTTATGTATACCACGCCTCTCCGCCGAGC 
SRFIHTDPSRFMYTTPLRRA 

241 TGCTGTCTCGCGTCTGGGAAAC ATGTCAC AAATAGATATGAGCGAGAAAGC AAGCAGTGA 

AVSRLGNMSQIDMSEKASSD 

3 01 CCTGGATGTGTCTTCTGAAGTGGATGTTGGTGGATACATGAGCGATGGTGATATCCTTGG 
LDVSSEVDVGGYMSDGDILG 

3 61 GAAGAGTCTGAGAGCGGATGATATCAACAGTGGGTACATGAGAGATGGTGGGCTCAACCT 
KSLRADDINSGYMTDGGLNL 

421 ATATACC AGAAGTCTTAACCGAGTCCCGGACAC AGC AACTTCC AGAGATGTCATAC AGAG 

YTRSLNRVPDTATSRDVIQR 

481 AGGCGTTCACGATGTGACAGTGGACGCAGACAGCTGGGATGACAGCAGTTCTGTGAGCAG 
GVH DVTVDADSWDDSSSVSS 

541 TGGCCTCAGTGACACACTTGATAACATTAGCACAGATGACCTCAACACCACGTCCTCCAT 
GLSDTLDNIS TDDLNTTS S I 

601 C AGTTCTTACTCC AAC ATC ACTGTCCCCTCCAGGAAGAACACTC AGCTGAAAAC AGATGC 

SSYSNITVP SRKNTQLKTDA 

6 61 GGAGAAACGTTCGACAACAGATGAGACCTGGGATAGTCCTGAGGAGCTGAAGAAAGCCGA 

EKRSTTDETWDSPEELKKAE 

721 GGGAGATTGTGACAGCCATGGTGACGGAGCCGCCAAGTGGAAGGGTGCTACTTCTGGACT 
GDCDSHGDGAAKWKGATSGL 

7 81 TGC TG AAG AC TC GGAG AAG AC AGGG C AG AAAGC C AG C CTGTC TG TG TC TC AG AC AGGC TC 

AEDSEKTGQKASLSVSQTGS 

841 CTGGAGGAGAGGCATGTCTGCCCAGGGAGGAACTCCAGCTACAGCTAGGCAGAAAACCAG 
WRRGMSAQGGTPATARQKTS 

9 01 CACAAGTGCACTCAAGACCCCTGGGAAGACAGATGATGCCAAAGCTTCCGAGAAAGGGAA 
TSALKTPGKTDDAKASEKGK 

9 61 AACTCCTCTCAAAGGATCATCCTTGC AAAGGTCTCCTTC AGATGC AGGG AAAAGC AG CGG_ 

'TPLKGSSLQRSPS DAGKS S G"" 

1021 GGATGAAGGGAAAAAGCCACCGTC AGGCATTGGAAGATCGACAGCC AGC AGTTCTTTTGG 
DEGKKPPSGIGRSTASSSFG 

1081 ATACAAGAAGCCAAGTGGTGTAGGGGCTTCCACTATGATTACCAGCAGCGGTGCCACCAT 
YKKPSGVGASTMITSSGATI 

1141 CACAAGCGGTTCAGCTACACTGGGG AAAATCCCCAAATCCGCTGCCATTGGTGGGAAGTC 
TS. GSATLGKI PKSAAIGGKS 

12 01 CAATGCAGGAAGGAAAACCAGCCTGGACGGGTCCCAGAATCAAGATGATGTTGTCCTGCA 
NAGRKTSLDG SQNQDDVVLH 

12 61 CGTGAGCTCGAAGACCACCCTCCAGTACCGTAGTTTGCCCCGCCCTTCTAAGTCCAGCAC 
VS SKTTLQYRSLPRPSKSST 

1321 CAGCGGAATCCCTGGGAGAGGTGGCCACAGGTCGAGC ACCAGCAGC ATTGATTCC AATGT 
SGIPGRGGHRSSTSSIDSNV 



Fig. 9 
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13 81 CAGC AGC AAGTCAGCTGGGGCCACC ACCTCCAAACTG AGAGAACCGACTAAG ATCGGCTC 
SSKSAGATTSKLREPTKIGS 

1441 AGGGCGCTCGAGTCCAGTCACTGTCAACCAAACAGACAAAGAGAAGGAGAAAGTAGCAGT 
GRSSPVTVNQTDKEKEKVAV 

1501 GTCAGATTCAGAGAGCGTTTCCTTGTCAGGTTCCCCCAAATCCAGCCCCACCTCTGCCAG 
SDSESVSLSGSPKSSPTSAS 

1561 TGCCTGTGGGACTCAAGGGCTCAGACAGCGAGGGTCC AAATATCCAGATATTGCCTCGCC 
ACGTQGLRQPGSKYPDIAS'P 

1621 CACATTTCGAAGGTTGTTCGGTGCCAAGGCAGGCGGCAAATCTGCCTCCGCACCTAATAC 
TFRRLFGAKAGGKSASA PNT 

1681 TGAGGGGGCGAAGTCCTCCTCAGTAGTGCTCAGCCCTAGTACCTCTTTAGCCCGACAAGG 
EGAKSS SVVLSPSTSLARQG 

1741 CAGTCTGGAGTCACCGTCGTCCGGTACGGGAAGCATGGGCAGTGCTGGTGGGCTGAGTGG 
SLESPSSGTGSMGSAGGLSG 

18 01 CAGCAGCAGCCCTCTCTTC AATAAACCCTC AGACCTAACTACAGATGTTATAAGCTTAAG 
SSSPLFNKPSDLTTDVISLS 

18 61 TCACTCCTTGGCTTCCAGCCCAGCGTCGGTTCACTCTTTCACATCCGGTGGGCTTGTGTG 

HSLASSPASVHSFTSGGLVW 

1921 GGCTGCCAATCTGAGCAGTTCCTCTGCCGGCAGCAAGGACACTCCAAGTTACCAGTCCAT 
A A N" LSSSSAGSKDTPSYQSM 

19 81 GACTAGTCTCCATACGAGCTCTGAGTCCATTGACCTGCCCCTCAGCCATCATGGCTCCCT 

TSLHTSSESIDLPLSHHGSL 

2 041 GTCTGGACTGACCACAGGCACTCACGAGGTGCAGAGCCTGCTCATGAGAACGGGTAGTGT 
SGLTTGTHEVQSLLMRTGS.V 

2101 GAGATCTACTCTCTCAGAAAGATACACCCCATCATCTCGGCAGGCCAACCAAGAAGAAGG 
RSTLSERYTPSSRQANQEEG 

2161 CAAAG AGTGGCTGCGATCGC ATTCC ACTGGCGGGCTGC AGGATACTGGCAACC AGTCTCC 
KEWLRSHSTGGLQDTGNQSP 

2221 CTTGGTCTCCCCTTCTGCCATGTCATCGTCAGCCACCGGAAAATATCACTTTTCCAACTT 
LVSPSAMSSSATGKYHFSNL 

22 81 GGTGAGTCCCACCAACCTCTCCCAGTTTAACCTGCCTGCACCCAGTATGATGCGCTCCAG 

VSPTNLSQFNLPAPSMMRSS 

23 41 CAGTATCCCCGCCCAGGACTCCTCCTTCGACCTCTATGATGATGCCCAGCTTTGCGGTAG 

SIPAQDSSFDLYDDAQLCGS 

2401 -TGCAACTTCCCTGGAGGAAAGGCCACGGGCCGTTAGCCACTCCGGCTCATTCAGAGACAG-" 
A.TSLEERPRAVSHSGSFRDS 

2 4 61 CATGGAGGAAGTTCATGGCTCTTCACTGTCATTGGTCTCCAGCACATCATCCCTTTACTC 
MEEVHGSSLSLVSSTSSLYS 

2521 TACGGCTGAAGAG AAGGCTCATTCAGAGCAAATCCATAAGCTACGGAGAGAACTGGTTGC 
TAEEKAHSEQIHKLRRELVA 

2 581 CTCCCAGGAGAAAGTCGCTACCCTCACGTCTCAGCTGTC AGC AAATGCTCACCTTGTAGC 
SQEKVATLTSQLSANAHLVA 

2 641 AGCTTTTGAAAAGAGTTTAGGGAATATGACTGGCCGTTTGCAAAGTCTAACCATGAC AGC 
AFEKSLGNMTGRLQSLTMTA 

27 01 GGAACAAAAGGAATCTGAGCTTATCGAACTGCGGGAAACC ATTGAAATGTTGAAGGCCCA 
E ._Q 'K E S E L I ELRETI EMLKAQ 
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27 61 GAACTCTGCTGCCCAAGCAGCCATTCAGGGAGCACTGAATGGCCCAGACCACCCTCCCAA 
NSAAQAAIQGALNGPDHPPK 

2821 AGATCTCCGC ATCAGAAGACAGC ACTCCTCTGAAAGTGTTTCTAGTATC AAC AGCGCAAC 
DLRIRRQHSSESVSSINSAT 

2881 GAGCCATTCCAGCATTGGCAGTGGTAATGATGCTGACTCCAAGAAA 
SHSSIGSGNDADSKK 
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Murine genomic sequence 



1 


GGGATGAAGG 


GAAAAAGCCA 


CCGTCAGGCA 


TTGGAAGATC 


GACAGCCAGC 


51 


AGTTCTTTTG 


GATACAAGAA 


GCCAAGTGGT 


GTAGGGGCTT 


CCACTATGAT 


101 


TACCAGCAGC 


GGTGCCACCA 


TCACAAGCGG 


TTCAGCTACA 


CTGGGGAAAA 


151 


TCCCCAAATC 


CGCTGCCATT 


GGTGGGAAGT 


CCAATGCAGG 


AAGGAAAACC 


201 


AGCCTGGACG 


GGTCCCAGAA 


TCAAGATGAT 


GTTGTCCTGC 


ACGTGAGCTC 


251 


GAAGACCACC 


CTCCAGTACC 


GTAGTTTGCC 


CCGCCCTTCT 


AAGTPPAPPA 


301 


C C AG CGGAAT 


CCCTGGGAGA 


GGTGGCCACA 


GG TCG AGC AC 


PAPPAPPATT 






TPAGPAPPAA 


PTP A PPTPPP 


PPPAPPAPPT 


PP A AAPTP&P 


Am 


apaapppapt 


AAGATPPGPT 


PAPPPPPPTP 


PAPTPPAPTP 

VJJ.tt.VjJ x V— V_tt.VJJ X V, 


APTPTP Z. a PP 


Am 

fi Ji 


A AAPAPAPA A 


A P A P A A PP A P 




TPTP A P & TTP 


AVXrt\jXrt\j\_.VJ 1 X 


DUl 




nTTPPPPP A a 






vj 1 x vjj 1 (jKjx 


J D -L 


p a PTP A a ppp 


PTP APAPAPP 


P A P f^HTPP A 7x 


2i T Zi. TPP 2i P A T 


Al J. Vjv-L- J. CoL- 


DUX 


p p a r* a tttp p 

\~ v_.tt.\ rt. i l l 




TATfiT'AAAnA 


P A TP ttppp a 

VJtt X \-3 X i VjJv.3VJJ.tt. 




U Ji 


VjjVjj X rtVjj X <rv X X 


A PP A TP T A TT 

X w X X X 


TATTPTPTAP 

X A X i v_- X \JJ i nL 


PA A APTATTT 
VjJtt-ttvttV_. X A XXX 


1 v_.tt. ±\3 ifii 1L 


701 


TiAATATTfT 

X X .TT X ± ^_ X 


AAPATTPTPT 


ATPTTATAPT 

XI — XX X tt.V — X 


TPTPT A A A AT 

i \JJ X V_ i ^ltt_tt-tt x 


ATAPTP2.TTT 


751 


T A TTTTPPTP 

X 4± X X X X u\.f X v_J 


ATTGCACCTG 


TTGCTAGTGT 

X X \J X xx\J X u X 


AAAAG P ATTP 


PTPATTTAPA 
v_- X V_/\ XXX -tt.vjr.tt. 


801 


GAGTGGTTAG 


C CTTTC AG CT 


ATACAGCCAG 


TGTGACACTA 


AAATAPAPAT 

A/lri X tt.V — tt. vjltt. X 


851 


ACC AC TTGT A 


GCGGGCATAA 


AACCACATGA 


CTG AC T ATTC 

\f X wnSv X X X 


ATAPA A ATA A 


901 


AGTG ATAGC T 


TGTAAAGATA 


TTTAGTGATT 


TC C AC CTC TC 


PTTTPPAPAA 


951 


TTAAAAAAAG 


CAAATTGCAT 


AGATCTTTAT 


AAACACATTT 


APTTPTAPTP 


1001 


T ATG TT ATC T 


TG T TG ACTC T 


TAATGAAATG 


GCAGTTATGA 


ATATAPATPA 


1051 


TATATTC TTT 


CTAACAGTTT 


ATAAGAGACC 


AATTTATACA 


PTAPPAPATP 


1101 


TTAACATAGT 


AACAATAACA 


GCAACAAAAA 


C AACC C AAAA 


AGC T ATP AAA 


1151 


GTATGGTCTG 


ATTGCAGAAT 


TTGAAAACAT 


TT AC ATG TTT 


PAPATAPPAP 


1201 


AAGAACTCAG 


GAGTGAGGTG 


ACTTTTTATA 


AGTCTTCATC 


A ATPTPPTTT 

Ail J. VjT 1 ^.V 1 X X 


1251' 


TACAGGAACC 


AGGAAGCATA 


TCTGATATAT 


GTGTCAGGAT 


TATCACTTTA 

x *i x • xxx 


1301 


TT AATTATG T 


GAAATTCTGT 


TTAGAAATCT 


AC C TG ATTTT 


AAATACTTTA 


1351 


ATATAGTAGG 


GGTCAAAATT 


AGTTAATGAG 


TTAAGACAAG 


TTGTTAAATA 


1401 


ATCCTGGCTC 


TGTTTTCTCA 


TCTTCAAAAT 


GATAGAGTAT 


AATTTATCAC 


1451 


CTCTTGTTAA 


ATATTTCAGG 


TTTGTGTTTA 


TTCTCTTGAT 


AACTTTGATC 


1501 


TCTTAGAAGA 


GTCTTGAAGA 


ATTTACATTA 


AGTAATCTTA 


GAAACATAAC 


1551 


TATTTGAGAA 


ACAGTAGTCA 


AATTTTGTCA 


TTAGAAGTAT 


TAACTCTGAA 


1601 


GAATGATTTG 


AAGTGACAGT 


TCTTAGAAAG 


AATAAATTAT 


AGCTTGTAGC 


1651 


AAGAGTAAAT 


ATTTTCACTG 


CTTGTGTGAG 


AGCCAAGAGC 


GCCCTCTTGT 


1701 


GGCCCATTAC 


CTATGAAACA 


ATTTCTCATA 


TTCGCCCTAG 


AAATCTTCCA 



Fig. 10 
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1751 


CTGCAGGAAA 


TAATGGATTT 


CATTGCCTCT 


GAATTAGTAA 


CCATTCTGCC 


1801 


ATTTCTTCAT 


ACCATTTTAT 


TTCCATACTT 


G C ATAAATTT 


GATTATGTCA 


1851 


TCTGCTTCAT 


TTACAAAACT 


AAAATGTTTT 


CTGAGCTAAA 


CTCCAGTAGC 


1901 


TAACTTAGTA 


C AAATGG TAT 


TTTTAAATCA 


CTGCTATAAG 


TATATATATT 


1951 


TGAATAGCTC 


TGGCAACGGA 


CGGAAATCCC 


TATGGTCTTT 


CCATGGGAAG 


2001 


ATACAAACCA 


ATCCATAAGT 


TGTCCAGCAA 


TATCCAATAT 


TTCCAGCCCA 


2051 


GCCAGTCAGG 


CCTCTTAAAC 


ATTACCTTAC 


ATATTTGAAC 


CTTTCCTTAA 


2101 


ATGTCCCCTT 


TAGACAATCT 


ATTTTTTAAA 


AAGATGAAAA 


TCCATTTAAG 


2151 


CATCATATAT 


CGAATGCGTA 


GAAGTTGTTT 


CATTATAATG 


GTTCTGCAGA 


2201 


TAGGTAATGC 


CAAAACGGCC 


AAAATATTTG 


ATCACTAGAA 


GCGTAAAAGT 


2251 


CAAGTACAAT 


CATGTTGACT 


TTTTTTCCAA 


GGTGGGTTCA 


CTGCTGCCCA 


2301 


CCTTGGTTCC 


AGGCCAGTGC 


TTACTTAAGA 


TATCGTAAGT 


GATTTTTTTT 


2351 


TAATTTTTAA 


TTTTTTAGTA 


GTTGGTTAAT 


CAAAAGCCAG 


TCATGTCACC 


2401 


TTCAGGAACA 


TAGAGGCTGG 


ACGTGCTTGG 


CAGCTCACGA 


CTCCAAAGCA 


2451 


CACTTGGCTC 


TGTGGACTGA 


AACCCTAGGA 


AACGTGGATG 


TGAGTCTCTT 


2501 


GGAACAACTC 


AAGTTGTTAT 


TTGTTTTTCT 


TTTAGGTTGT 


TCGGTGCCAA 


2551 


GGCAGGCGGC 


AAATCTGCCT 


CCGCACCTAA 


TAC 










Fig. 10 


(cont'd) 
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T2HC 



Homologous human cDNA 



1 

61 

121 

181 

241 

301 

361 

421 

481 

541 

601 

661 

721 

781 

841 

901 

961 

1021 

1081 

1141 



GGATCAGCTTCGGGAGACCATGCACAACATGCAGTTGGAGGTGGACCTGCTGAAAGCAGA 
DQLRETMHNMQLEVDLLKAE 

GAATGACCGACTGAAGGTAGCCCCAGGCCCCTCATCAGGCTCCACTCCAGGGCAGGTCCC 
NDRLKVAPGPSSGSTPGQVP 

TGGATCATCTGCATTATCTTCCCCACGCCGCTCCCTAGGCCTGGCACTCACCCATTCCTT 
GSSALSSPRRSLGLALTHSF 

CGGCCCCAGTCTTGCAGACACAGACCTGTCACCCATGGATGGCATCAGTACTTGTGGTCC 
GPSLADTDLSPMDGI STCGP 

AAAGGAGGAAGTGACCCTCCGGGTGGTGGTGAGGATGCCCCCGCAGCACATCATCAAAGG 
KEEVTLRVVVRMPPQHI IKG 

GGACTTGAAGCAGCAGGAATTCTTCCTGGGCTGTAGCAAGGTCAGTGGAAAAGTTGACTG 
DLKQQEFFLGCSKVS GKVDW 

G AAGATGCTGGATGAAGCTGTTTTC CAAGTGTTC AAGGACTATATTTCTAAAATGGAC C C 
KMLDEAVFQVFKDYI SKMDP 

AGCCTCTACCCTGGGACTAAGCACTGAGTCCATCCATGGCTACAGCATCAGCCACGTGAA 
AST LGL S TE S I HGY S I SHVK 

ACGAGTGTTGGATGCAGAGCCCCCCGAGATGCCTCCTTGCCGTCGAGGTGTCAATAACAT 
RVLDAEPPEMPPCRRGVNNI 

ATCAGTCTCCCTCAAAGGTCTGAAGGAGAAATGCGTCGACAGCCTGGTGTTCGAGACGCT 
SVSLKGLKEKCVDSLVFETL 

GATCCCCAAGCCGATGATGCAGCACTACATAAGCCTCCTGCTGAAGCACCGGCGCCTCGT 
IPKPMMQHYISLLLKHRRLV 

CCTCTCGGGCCCCAGCGGCACGGGCAAGACCTACCTGACCAATCGCTTGGCCGAGTACCT 
LSGPSGTGKTYLTNRLAEYL 

GGTGGAGCGCTCTGGCCGTGAGGTCACAGAGGGCATCGTCAGCACCTTCAACATGCACCA 
VERSGREVTEGIVSTFNMHQ 

GCAGTCTTGCAAGGATCTGCAACTGTATCTTTCCAACCTAGCCAACCAGATAGACCGGGA 
QSCKDLQLYLSNLANQIDRE 

AACAGGAATTGGGGATGTGCCCCTGGTGATTCTATTGGATGACCTGAGTGAAGCAGGCTC 
TGIGDVPLVILLDDLSEAGS 

CATCAGTGAGTTGGTCAATGGGGCCCTCACCTGCAAGTATCATAAATGTCCCTATATTAT 
I SELVNGALTCKYHKC PYI I 

AGGTACCACCAATCAGCCTGTAAAAATGACACCCAACCATGGCTTGCACTTGAGCTTCAG 
GTTNQPVKMTPNHGLHLSFR 

GATGTTGACCTTCTCCAACAACGTGGAGCCAGCCAATGGCTTCCTGGTTCGTTACCTGAG 
MLTF SNNVE PANGFLVRYLR 

GAGGAAGCTGGTAGAGTCAGACAGCGACATCAATGCCAACAAGGAAGAGCTGCTTCGGGT 
RKLVESDSDINANKEELLRV 

GCTCGACTGGGTACCCAAGCTGTGGTATCATCTCCACACCTTCCTTGAGAAGCACAGCAC 
L DWV P K L W Y H L H T F L E K H S^T 



Fig. 11 
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CTCAGACTTCCTCATCGGCCCTTGCTTCTTTCTGTCGTGTCCCATTGGCATTGAGGACTT 
SDFLIGPCFFLSCPIGIEDF 

C CGGAC CTGGTTC ATTG ACC TGTGG AAC AACTCT ATC ATTCC CTATCTAC AGGAAGG AGC 
RTWF IDLWNNS I IPYLQEGA 



1321 C AAGGATGGGATAAAGGTCC ATGGACAGAAAGCTGCTTGGGAGGACCCAGTGGAATGGGT 

KDGIKVHGQKAAWEDPVEWV 

13 81 ■ - CCGGGACACACTTCCCTGGCCATCAGCCCAACAAGACCAATCAAAGCTGTACCACCTGCC 
RDTLPWPSAQQDQSKLYHLP 

1441 CCCACCCACCGTGGGCCCTCACAGCATTGCCTCACCTCCCGAGGATAGGACAGTCAAAGA 
PPTVGPHSIASPPEDRTVKD 



1501 CAGCACCCCAAGTTCTCTGGACTCAGATCCTCTGATGGCCATGCTGCTGAAACTTCAAGA 
STPSSLDSDPLMAMLLKLQE 

1561 AGCTGCCAACTACATTGAGTCTCCAGATCGAGAAACC ATCCTGGACCCCAACCTTC AGGC 

AANYIESPDRETILDPNLQA 

1621 AACACTTTAAGGGTTCGGCAATCACTGTCACCCCCGGACAGCAGAACGCTGGCATCAGCT 
T L * 

16 81 ATCTTAGCTCCTCCTCTCCCCTCTCCTCTTTCAGAGCACTGGCTCTCCAGCCCCAGGAGG 

1741 AGAACAGGAGGGAGGAGGAGATGAAAGAGGAGGGACAGGTTCTTGGTGCTGTACCTTTGA 

1801 GAACTTCCTAGGAAGGAATGGTGGGGTGGCGTTTGGGAACTTGTGCCCCCTAAACACATT 

1861 TACTGGCCTCCTCTAATGACTTTGGGGAAAAGATGATTCTGGGTCTTTCCCTTGACTTCT 

1921 TGTTTCAATTACAAACTCCTGGGCTTTCTGGGGAGGGGTTCAGAAAACATCAAAACACTG 

19 81 CAGCAGTTCCTAAATGATTCTCACAAGCAACCCTGAGAGAGACAGTCTTGTGAGGGAGAT 

2041 CTGGGGGAGGCAGGAAGCTCCTCAGATTTTCTCACAGACCCTTCCCAATTCCATCACCAC 

2101 TGCCAACAACTCCTCCCCCAGAGATCTGGCTGGAGCCCAGAAAAAGAAGCATGTGGTTTA 

2161 AAAAATGTTTAAATCAATCTGTAAAAGGTAAAAATGAAAAACAAAAACAAGCAAACAAAC 

2221 AAAAAACAATGGAAAAGATGAAGCTGGAGAGAGAGGAACCAGTTGCCAAGGTAGAGAGCT 

22 81 GCCCGCTCCTGCCCTCTGGATGACATAGGGGACATCAACAAGACGGCTGCCAACCTGAGA 

2341 AGTCACCAAACCACAAAAATAACCTTACAGCCTTCAGGGAAAGACTACCAGCTCTGTCTT 

2401 TCTACCCTCTAATTTAACAATGCATAAGAGTCAATAAACCCTACTTTTTTAAAAAAAAAA 



Fig. 11 (cont'd) 
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Homologous murine cDNA sequence 



1 GAACTATGGGAAAAAGAGATGAAGCTCACGGATATCCGGTTGGAGGCCCTCAACTCTGCC 
ELWEKEMKLTDIRLEALNSA 

61 CACCAGCTGGACCAGCTTCGGGAGACCATGCACAATATGCAGTTGGAGGTGGACCTGCTG 
HQLDQLRETMHNMQLEVDLL 

121 AAAGCAGAGAATGACCGGCTGAAGGTTGCCCCCGGCCCCTCCTC AGGCTGCACTCCAGGG 
KAENDRLKVAPGPSSGCT PG 

181 CAGGTCCCTGGGTCATCGGCTCTGTCGTCCCCTCGACGTTCCCTGGGCCTTGCACTCAGC 
QVPG S SAL S S PRRSLG LAL S 

241 CATCCTTTCAGTCCTAGTCTCAC AGACACAGACCTCTC ACCCATGGATGGC ATCAGCACC 
HPFS P SLTDTDLSPMDGI ST 

301 TGTGGTTCAAAGGAAGAGGTGACCCTGCGGGTGGTGGTCCGGATGCCGCCCCAGCACATC 
CGSKEEVTLRVVVRMPPQHI 

3 61 ATCAAAGGGGACTTAAAGCAGCAGGAGTTCTTCCTGGGTTGCAGCAAGGTCAGTGGCAAA 
IKGDLKQQEFFLGCSKVSGK 

421 GTTGACTGGAAGATGCTGGATGAAGCCGTTTTCC AAGTGTTC AAGGACTAC ATTTCTAAA 
VDWKMLDEAVFQVFKDYISK 

481 ATGGACCCAGCCTCAACCCTGGGACTGAGCACTGAGTCCATACATGGCTATAGCCTCAGC 
MDP.ASTLGLSTESIHGYSLS 

541 CACGTGAAACGAGTGCTGGATGCTGAGCCCCCAGAGATGCCTCCTTGCCGCCGAGGTGTC 
HVKRVLDAEP PEMPPCRRGV 

601 AATAACATATC AGTCGCTCTCAAAGGTCTGAAAGAGAAGTGTGTCGAC AGCCTGGTGTTC 
NNI SVALKGLKEKCVDSLVF 

661 GAGACGCTTATCCCCAAGCCCATGATGCAGCACTACATCAGCCTCCTGCTCAAGCACCGG 
ETLI PKPMMQHYISLLLKHR 

721 CGCCTGGTGCTCTCCGGCCCCAGTGGCACCGGCAAGACCTACTTGACCAATCGGCTAGCC 
RLVL SGPSGTGKTYL .TNRLA 

7 81 GAGTACCTGGTGGAGCGCTCCGGCCGCGAGGTCACGGATGGCATCGTCAGC ACTTTCAAC 
EYLVERSGREVTDGIVSTFN 

841 ATGCACCAGCAGTCTTGCAAGGATCTGCAACTGTACCTCTCCAACCTAGCCAACCAGATA 
MHQQ SCKDLQLYLSNLANQ I 

901 GACCGGGAAACAGGGATAGGGGATGTGCCCTTGGTGATCCTCCTGGATGATCTGAGTGAA 
DRETGIGDVPLVILLDDLSE 

961 GCAGGCTCCATCAGTGAGCTGGTCAATGGGGCCCTCACCTGCAAGTATCACAAATGTCCC 
AGS I SELVNGALTCKYHKC P " 

1021 TACATTATAGGTACCACCAATCAGCCTGTAAAAATGACACCCAACCATGGCTTGC ACTTG 
YIIGTTNQPVKMTPNHGLHL 

1081 AGCTTCAGGATGCTGACCTTCTCGAACAATGTGGAACCAGCCAATGGCTTTCTGGTCCGT 
SFRMLTFSNNVEPANGFLVR 

1141 TACCTGCGGAGGAAGTTGGTAGAGTCAGACAGTGACGTCAATGCTAACAAGGAAGAGCTG 
YLRRKLVESDSDVNANKEEL 

12 01 CTTCGGGTGCTGGACTGGGTGCCC AAGCTGTGGTATC ACCTCC AC ACCTTCCTGG AG AAG 

LRVLDWVPKLWYHLHTFLEK 

1261 CACAGCACCTCGGACTTCCTCATTGGCCCTTGCTTCTTCCTGTCCTGTCCCATTGGCATC 
HSTSDFLIGPCF.FLSCPIGI 

13 21 GAGGACTTCCGGACCTGGTTCATTGACCTGTGGAACAATTCCATCATCCCCTATCTACAG 3 — 

EDFRTWFIDLWNNSI IPYLQ 



Fig. 12 
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13 81 GAAGGAGCCAAGGATGGGATCAAGGTTCATGGACAGAAAGCTGCTTGGGAAGACCCGGTG 
EGAKDGIKVHGQKAAWEDPV 

1441 GAATGGGTCCGAGACACTCTTCCCTGGCCGTCGGCCCAACAAG ACC AATC AAAGCTCTAC 
EWVRDTLPWPSAQQDQSKLY 

1501 C ACCTGCCCCCGCCTTCTGTGGGCCCCC AC AGCACTGCCTCACCCCCGGAGGACAGGACA 
HLPPPSVGPHSTASPPEDRT 

1561- GTCAAAGACAGCACTCCAAACTCCCTCGACTCAGATCCCCTGATGGCCATGCTACTGAAA 
VKDSTPNSLDSDPLMAMLLK 

1621 CTCCAAGAAGCTGCCAACTAC ATTGAGTCACCAGATCGAGAGACTATCCTGGACCCCAAC 
LQEAANYIESPDRETILDPN 

1681 CTQCAGGCGACACTCTGAGGGCCCGGCAGTCACTGTCACCCTGGAGGGCAGAAGGCTGGC 
L Q A T L * 

1741 TTCAGCATCATTAGCTCTCCTCTGCCCTCTTCCTTCATAGCTCTGGCTCACC AGCCTCGC 

1801 CAAGAGAACAGGAGGGAAGAAGAGGGCAGGAGGAGGGATGGGTTCTCGGTGCTGAACCTT 

1861 TGAGAACTTCCTACTAGGAATTGGAGGGGGTGGAGTTTGAGAACTCCGTGCCCCTTAACT 

1921 ACATTTGCTGGCCTCCTCTTACGACTTAGGAGAAAAGATGATTCTGGTCTTTTCTTC AAG 

19 81 TTTTGTTTCACCTAC AAACTCTTGGGCTTTCTGGGGAGGGATTCGGAAGATATAAAC AGA 

2 041 CAAACAAAAACAAACAAACCAACTAC AGCAGTTCCAAGCTCGTTCTC ACAAAC ACCTCTG 

2101 AGACAGTCAC ATGTGGGCAAATCTAAGGGAGGC AGGAAGCTCTACAGACTTTCTTGCAAA 

2161 CCCTTCCCAGTTCTGTCGACACTGCC AACAACCTCCCCGCCAGAGACCTGGCCAGAGCCA 

2221 AGAAAAGAGAAGCATGTGGTTTAACAGAAAAACAAAACAAAACAAAACAAAAAATATATG 

2281 TGTAAATCAACCTGTAGAAGGTAAAAACGGCAATGGAAAAGATGAAGCTGGAAGGAGGGG 

2341 CCCAGTTGCCAAGATGGAACGAGAGCTGCCAGATCTTGCCTTCTGGATGACAAGAGGGGA 

2401 CATTGCAAGATGGCTGCC AGTCTAAAACGTC ACC AGACCAC AAGAGTAAC ATC AC AGCCT 

2461 TCGAAGAAAGGCCACAAGCTGTCTTTCTGCCCTCTAACTGAACATGCATG AAAAGTCAAT 

2521 AAACCCTACTTTTTAATTTTTAAAAAAAAAAAAAAAAAAAAAAAAAAA 



Fig. 12 (cont'd) 
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T2 Murine cDNA with following intron 



CCAATAGAACTCCGGATCAAGAGGCAGAATTCCTCAGATAGCATCTCCAGCCTCAACAGC 
X + + + + + + 60 

PIELRIKRQNSSDSISSLNS 

ATC AC CAG C C ATTC CAGCATCGGCAGCAGC AAAG ATGCTG ATG CCAAGAAGAAAAAG AAG 
61 + + + + + + X2Q 

ITSHSSIGSS KDADAKKKKK 

AAGAGTTGGGTATGTAAAGGCTTGGGGATCGGCCTGTGCTAGGAGTCACTCACCCTGTTG 

121 + + + + + + 180 

K S W 

CAGGG AACTGAC C C CTTTCAGGATC AAC AAAGAGGGTC C CTTCTAAC AGGATG C CAGTGT 
181 + + + + + + 240 

TGTGACATCTGCTGGGGACAAAAATTCACTAAGTTCCCATTCCTCTATCCATTGTCTATT 
241 + + + + + + 300 

CTCCTTACCACCGCCCTGCACATATACCCCAGCCCCCCACCGTCCCTGCATCCTTTATAC 
301 + + + + + + 360 

ATGTCTGCTATCCTGGGGCTCTACCTACTGATGAGGTCAAATGTATTTGGCCGTAGAAGG 
361 + + + + + + 420 

AGCTGAGAAAATTATTCATGGGTGGGAGAGTGGGGCATGTGGAGAGAATTTGTAAGCCAA 
421 + + + + + + 480 

GCAGGGTACTCTAGACGCTCCTGGGGCTGTTGCTTTAGTTTGGGTGAGGAGGCTGTGGAA 
4 81 + + + + + -■• + 540 

CGTCCCCATCGCTCCAAAGCCTGCTTTTGTCTGGTCCAGAGGTGGGTTTGTTCTGTGTGG 
541 + + + + + + 600 

TATCCCCCCTGTAACTCTAAACTGGCTTTGGGTGAGCTTTCTACAATCTGTACGCAGGTG 
601 + + + + + + 660 

TAGGGCACTGCCTGACTGACTGAAAGGGAGAGTGACCCAGAGTGAGCGTCTTGTCCCTGT 
661 + + + + + + 720 

CCCTGCTGAGGAGGGCTGGCTACAGACTTTGGCCTAGTGCAGACAGGAGCCAGCTGTGTG 
721 + + + . + + + 780 

GAGAAGCAGCTGTGTGAAATGCATGAGTAGTGTCGCTGCTGCTGCTGCTGCTGCTTTCTT 
781 + + + + + + 8 40 

TTCATTGTTTTTTTTTTTTTTTCTTTCCTTTTATTTCCTTCAAAATGCTGACCTCAAATC 



901 + + + + + + 960 

TTCCCTTTGCACACCTTCGTGTTTCCAGAGCAAGCACCTCTCTTAAAAAAAAAAAAAAAA 
961 + + + + + + 1020 

AAAAA I 
1021 1025 - " 

Fig. 13 
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splicing variant x (JFC4l0) 

1 

61 

121 GTCAAAAAAAGCCTACT, 



TC C AG TC C fT C^^ 



•AGGCTTCCAGGGCCCTCTAGGGTGCCTGCTQCAGGAAGCAGCA 
RLPGPSRVP AAGSSS 



241 AA 

bp 1 corresponds to bp 914 of THC 

underlined sequence represents further splicing form and is not 
shown in the THC sequence 



Fig. 14 
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splicing variant 2 



1 GGCACTCACGAGGTCCAGAGCCTGCTCATGAGAACGGGTAGTGTGAGATCTACTCTCTCA 
GTHEVQSLLMRTGSVRSTLS 

6 1 GAAAGATATACCCCATCATCTCGGCAGGCCAACCAAGAAGAGGGCAAAGAGTGGTTGCGT 
ERYTPSSRQANQEEGKEWLR 

12 1 TCTCATTCTACTGGAGGGCTTCAGGACACTGGCAACCAG 
S "'"H STGGLQDTGNQ 



bp 1 corresponds to bp 3300 of THC 

underlined base pairs -> position of the differentially spliced exon which 
lacks here but is shown in the THC sequence 



Fig. 15 
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T2-cDNA sequence and T2 protein encoded therein 



CCGCGGGGCTTCCATCCTTCCTTTGACTGATTTTTAAATTTTAATTTGTATTTTCCCCGC 
1 + + + + + + 60 

RGAS ILPLTDF* ILICIFPA- 

CGCCCCGCCCCTTTTCCTCCGACCCCGCCCTATCGCTCCCCGGCTTCCCTGCTCTTTCCT 

6! + + + + + + 120 

APPLFLRPRPIAPRLPCSFL- 

TTTTCCCGGCTTCCTTCCTCGCGTTTCTTTCCCCTGCGCCCTCGGCTTGCCTCTCTCCCT 

121 ;_- + + + + --. + -+ 180 

FPGFLPRVSFPCALGLPLSL- 

CCTCCCTCGCTCTCTCCCCCTTCTCTCCCCTTCTTCCTCGGTTTCTTCCGTCCTCTCTCT 

181 + + + + + + 240 

LPRSLPLLSPSSSVSSVLSL- 

CCCCCTCCTCCTCCCCCGCCTCCTCCTCCTGCGCTCCCGCCCCCTGCCCCCTCCCCCCGT 

241 + + + + + + 300 

PLLLPRLLLLRSRPLPPPPV- 

GCCTGCAGACGCGCGGATCGTCCATGCGCTCCTCGCGGGCAGAATGCTGGGCAGCAGCGT 

301 + + + -f + + 360 

PADAR IVHALLAGRMLGS SV 

CAAGAGCGTGCAGCCCGAGGTGGAGCTGAGCAGCGGCGGCGGCGACGAGGGCGCGGACGA 

361 + + + + + + 420 

KSVQPEVELSSGGGDEGADE - 

ACCGCGGGGCGCCGGCAGGAAGGCGGCAGCGGCGGACGGCAGAGGCATGCTGCCCAAGCG 

421 + + + + + + 480 

PRGAGRKAAAADGRGML.PKR- 

CGCCAAGGCGCCCGGCGGCGGCGGCGGCATGGCCAAGGCCAGCGCGGCTGAGCTGAAGGT 

481 + + + + + + 540 

AKAPGGGGGMAKASAAELKV- 

CTTCAAGTCCGGCAGCGTGGACAGCCGTGTCCCCGGCGGGCCGCCCGCCTCCAACCTGCG 

541 + + + + + + 600 

FKSGSVDSRVPGGPP ASNLR 

CAAGCAGAAGTCACTCACCAACCTCTCTTTTCTCACGGACTCCGAGAAAAAGCTGCAGCT 

601 + + + + + + 660 

KQKSLTNIiSFLTDSEKKIjQIj- 

TTATGAGCCCGAATGGAGCGACGATATGGCCAAGGCGCCCAAAGGCTTAGGCAAGGTGGG 

661 + + + + + + 720 

YEPEWSDDMAKAPKGLGKVG 

GTCCAAGGGCCGTGAAGCTCCGCTGATGTCCAAGACGCTGTCCAAGTCGGAGCACTCGCT 

721 + + + + + + 780 

SKGREAPLMSKTLS KS EHSL 

CTTCCAGGCCAAGGGCAGCCCGGCGGGCGGCGCCAAGACCCCCCTGGCTCCGCTCTCGCC 

781 + + + + + 840 

FQAKGS PAGGAKTPLAPLiAP 



Fig. 16 
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CAACCTGGGAAAGCCGAGCCGGATCCCTCGAGGACCCTATGCGGAGGTCAAGCCGCTCAG 
+ + + + + + 

NLGKPSRI PRGPYAEVKPITS 

CAAGGCGCCTGAAGCGGCCGTGAGCGAAGATGGCAAATCGGACGACGAGCTGCTCTCCAG 
+ + + + + + 

KAPEAAVSEDGKSDDELLSS 

CAAGGCCAAGGCGCAAAAGAGCTCTGGGCCTGTCCCCTCTGCCAAGGGCCAGGAGGAGCG 
h + + + ^ + 

KAKAQKSSGPVPSAKGQEER 



CGCCTTCCTCAAGGTGGACCCCGAGCTGGTGGTGACCGTGCTGGGAGACCTGGAGCAGCT 

1021 + + + + + + 1080 

A^F LKVDPELVVTVLGDIjEQL - 

GCTCTTC AGC CAGATGCTGGACC CAGAGTC CC AGAG AAAGAGGACAGTGCAGAATGTCCT 

1081 + + + + + + 1140 

LFSQMLDPES QRKRTVQNVL - 

GGATCTCCGGCAGAACCTGGAAGAGACCATGTCCAGCCTGCGAGGGTCCCAGGTGACTCA 
1141 + + + + + + 1200 

DLRQNLEETMSSLRGSQVTH- 

CAGCTCCCTGGAGATGACCTGCTACGACAGCGATGATGCCAACCCACGCAGCGTGTCCAG 
1201 + + + + + + 1260 

SSLEMTCYDSDDANPRSVSS - 

CCTCTCCAACCGCTCGTACCCTCTGTCATGGCGCTATGGCCAGTCCAGTCCGCGGCTGCA 

1261 + + + + + -+ 1320 

LSNRSYPLSWRYGQSSPRLQ- 

GGCTGGTGACGCGCCCTCTGTGGGTGGGAGCTGCCGCTCGGAGGGGACGCCGGCCTGGTA 

1321 + + + + + -- + 1380 

AGDAPSVGGSCRSEGTPAWY- 

CATGCACGGCGAACGGGCCCACTACTCCCACACCATGCCCATGCGCAGCCCCAGCAAGCT 

1381 + + + + + + 1440 

MHGERAHYSHTMPMRSPSKL- 

CAGCCATATCTCCCGCCTGGAGCTGGTCGAATCCCTGGACTCGGATGAGGTGGACCTCAA 

1441 + + - + + + + 1500 

SHI SRLELVESLDSDEVDLK- 

GTCCGGCTACATGAGCGACAGTGACCTCATGGGCAAGACCATGACGGAGGATGATGACAT 

1501 + + + + + + 1560 

SGYMSDSDLMGKTMTEDDDI 

JCACTACCGGCTGGGATGAAAGCAGCTCCATCAGTAGTGGACTCAGCGATGCCTCAGACAA 

1561 + + + + + + 1620 

TTGWDESSSISSGLSDASDN- 

TCTCAGTTCAGAAGAATTCAATGCCAGCTCCTCACTCAACTCCCTCCCAAGTACTCCCAC 

1621 + + + + + + 1680 

LSSEEFNASSSLNSLPSTPT- 

TGCTTCTCGCAGGAACTCAACAATAGTGCTACGCACAGACTCAGAGAAGCGCTCACTGGC 
1681 + + + + + + 1740 

ASRRNSTIVLRTDSEKRS&A-- 



Fig. 16 (cont'd 1) 
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AGAAAGTGGGCTGAGCTGGTTTAGTGAATCAGAGGAGAAAGCCCCTAATiAAACTGGAGTA 
1741 + + + + + + 180Q 

ESGLSWFSESEEKAPKKLfi'Y- 

CGACAGTGGTAGCCTGAAGATGGAACCTGGGACTTCTAAGTGGCGGAGGGAGCGGCCTGA 

1801 + + + + ^ 

+ 1860 



DSGSLKMEPGTSKWRRER 



P E 



GAGCTGTGATGATTCATCCAAGGGTGGAGAACTGAAAAAGCCCATCAGCCTGGGCCACCC 
1 861 + + + + + + i92Q 

SCDDSSKGGELKKPISLGHP- 



TGGTTCCCTGAAGAAGGGCAAGACCCCACCTGTGGCTGTAACTTCCCCCATCACTCACAC 

+ + H + + + 

G SLKKGKTPPVAVTSPITHT 

AGCCCAGAGTGCCCTCAAAGTCGCAGGCAAACCTGAGGGCAAAGCTACAGACAAGGGTAA 
+ + + + h + 

AQSALKVAGKPEGKATDKGK 

GCTTGCAGTGAAGAATACTGGGCTCCAACGCTCCTCCTCTGATGCTGGTCGGGACCGCCT 

+ + + + + 

LAVKNTGLQRSSSDAGRDRL 
GAGTGATGCTAAGAAGCCCCCCTCGGGCATTGCTCGCCCCTCCACTTCGGGATCCTTTGG 

+ + h H + 

SDAKKPPSGIARPSTSGSFG 
CTACAAGAAGCCTCCTCCTGCCACAGGCACAGCCACTGTCATGCAAACTGGTGGTTCAGC 

+ + H h + h 

YKKPPPATGTATVMQTGGSA 



CACTCTCAGCAAGATCCAGAAGTCCTCAGGCATCCCTGTCAAGCCAGTAAATGGGCGCAA 

2221 + + + + + + 2280 

TLSKIQKSSGIPVKPVNGRK- 

GACTAGCTTAGATGTTTCCAACAGTGCAGAGCCAGGATTCCTGGCTCCTGGAGCCCGTTC 
2281 + + + + + + 2340 

TSLDVSNSAEPGFLAPGARS - 

TAACATCCAGTACCGCAGCCTGCCCCGGCCAGCCAAGTCAAGTTCTATGAGCGTGACCGG 
2341 + + + + + + 2400 

NIQYRSLPRPAKSSSMSVTG 

CGGGCGGGGTGGACCTCGCCCTGTGAGCAGCAGCATTGACCCCAGTCTCCTCAGCACCAA 
2401 + + + + + + 246Q 

GRGGPRPVSSSIDPSLLSTK- 

.GCAGGGAGGCCTTACGCCTTCCAGACTGAAGGAGCCTACCAAGGTAGCCAGTGGGCGGAC 
2461 + + + + + + 2520 

QGGLTPSRLKEPTKVASGRT- 

CACTCCAGCCCCTGTCAATCAGACAGATCGGGAAAAGGAGAAGGCCAAAGCCAAGGCAGT 

2521 + + + + + + 2580 

TPAPVNQTDREKEKAKAKAV- 

GGCCTTGGACTCAGACAACATCTCCTTGAAGAGTATTGGCTCCCCAGAAAGTACTCCCAA 

2581 + + + + + 2640 

ALDSDNIS LKS IGSPE ST^-K- 

Fig. 16 (cont'd 2) 
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GAACCAAGCAAGCCACCCCACAGCCACCAAGCTGGCAGAGCTGCCACCAACCCCTCTCAG 

2641 + + + + + + 2700 

NQAS HPTATKLAELP P TPL R 

GGCCACAGCGAAGAGCTTTGTCAAACCACCCTCACTAGCCAATCTTGACAAGGTCAACTC 

2701 + + + + + + 2760 

ATAKS FVKPPSLANLDKVNS 

CAACAGTCTGGATCTACCATCATCCAGTGATACCACCCATGCTTCAAAGGTCCCAGATCT 

2761 + + + + + + 2820 

NSLDLPSSSDTTHAS KVPDL 

GCATGCTACAAGCTCAGCATCTGGGGGCCCTCTCCCTTCCTGCTTCACCCCCAGTCCGGC 

2821 + + + + + + 2880 

H A TSSASGGPLPSC FTPSP^- 

ACCCATCCTCAATATTAACTCAGCCAGCTTCTCCCAGGGCCTGGAGCTAATGAGTGGTTT 

2881 + + + + + + 2940 

PILNINSASFSQGLELMSGF- 

CAGTGTGCCAAAAGAGACCCGCATGTACCCCAAACTCTCAGGCCTGCACAGGAGCATGGA 

2941 + + + + + + 3000 

SVPKETRMYPKLSGLHRSME 

GTCCCTCCAGATGCCAATGAGCCTCCCCAGTGCCTTCCCCAGCAGTACTCCCGTCCCCAC 

3001 + + + + + + 3060 

SLQMPMSLPSAFPSSTPVPT- 

CCCACCTGCTCCCCCTGCTGCTCCCACAGAAGAAGAGACGGAAGAGCTGACTTGGAGTGG 

3061 + + + + + + 3120 

PPAP PAAPTEEETEELT WSG 

AAGCCCCAGAGCTGGGCAACTGGACAGTAATCAGCGGGATCGGAACACTCTTCCCAAGAA 

3121 + + + + + + 3180 

SPRAGQLDSNQRDRNTLPKK- 

AGGGCTCAGGTACCAGCTTCAGTCCCAGGAGGAGACCAAGGAGAGGCGACATTCCCATAC 

3181 + + + + + + 3240 

GLRYQLQSQEETKERRHSHT- 

CATTGGTGGGCTGCCTGAATCCGATGACCAGTCAGAGCTGCCTTCTCCCCCTGCACTTCC 

3241 + + + : + + + 3300 

IGGLPESDDQSELPSPPALP- 

CATGTCTCTGAGTGCAAAGGGCCAACTTACCAACATAGTGAGTCCCACTGCGGCCACCAC 

3301 + + + ' + + + 3360 

MSLSAKGQLTNIVSPTAATT- 

"GCCAAGAATCACCCGCTCCAACAGCATCCCCACCCACGAGGCGGCCTTCGAGCTGTACAG 

3361 + + + + + + 3420 

PRITRSNS I PTHEAAFELYS 

CGGCTCCCAAATGGGGAGCACCCTGTCCCTGGCCGAGAGACCCAAGGGAATGATTCGGTC 

3421 + + + + + + 3480 

GSQMGSTLSLAERPKGMIRS 

AGGATCCTTCCGAGACCCCACGGACGATGTTCACGGCTCAGTGCTGTCCCTGGCCTCCAG 

3481 + + - + + + < + 3540 

GSFRD PTDDVHGSVLS L A ST ~S - 

Fig. 16 (cont'd 3) - _ 
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TGCCTCCTCCACCTACTCCTCAGCTGAGGAGAGGATGCAATCTGAGCAAATCCGGAAGCT 
3541 + + + + + + 3600 

ASSTYSSAEERMQSEQIRKL- 

TCGTAGGGAACTGGAATCATCCCAGGAAAAAGTGGCCACCTTGACGTCTCAGCTTTCTGC 
3601 + + + + + + 3660 

RRELESSQEKVATLTSQLSA- 

CAATGCTAATCTGGTGGCTGCTTTTGAGCAGAGCCTGGTGAATATGACATCCCGCCTGCG 

3661 + + + + + + 3720 

NANLVAAFEQS LVNMTS RLR 

ACACCTGGCAGAGACGGCCGAGGAGAAGGACACTGAGCTGCTGGATTTGCGAGAAACCAT 

3721 + + + + + + 3780 

H_LAETAEEKDTELLDLRETI 

AGACTTTCTGAAGAAAAAGAACTCTGAGGCCCAGGCAGTCATTCAGGGAGCCCTTAATGC 

3781 + + + + + + 3840 

DFLKKKNSEAQAVIQGALNA- 

CTCAGAAACCACACCCAAAGAACTTCGGATCAAGAGACAAAACTCCTCAGATAGCATCTC 

3841 + + + + + + 3900 

SETTPKELRIKRQNS SDSIS 

AAGCCTCAACAGCATCACTAGCCATTCCAGCATCGGCAGCAGCAAGGATGCTGATGCGAA 

3901 + + + + + + 3960 

SLNS ITSHSS IGSSKDADAK- 

AAAGAAGAAAAAAAAGAGTTGGCTTCGAAGTTCCTTCAACAAAGCGTTCAGTATAAAAAA 

3961 + + + + + + 4020 

KKKKKSWLRSSFNKAFS I K K - 

GGGGCCCAAGTCAGCTTCCTCATACTCGGATATAGAGGAGATTGCTACACCCGACTCTTC 

4021 + + + + + - + 4080 

GPKSASSYSDIEEIATPDSS- 

AGCCCCCTCATCCCCCAAACTACAGCATGGTTCTACAGAGACTGCTTCACCCTCCATCAA 

4081 + + + + + + 4140 

APSSPKLQHGSTETAS PSIK- 

GTCCTCCACCTCGTCCTCCGTGGGCACTGATGTCACCGAGGGCCCTGCTCACCCAGCCCC 

4141 + + + + + + 4200 

SSTSSSVGTDVTEGPAHPAP- 

CCACACTAGGCTGTTCCATGCAAATGAGGAGGAGGAGCCAGAGAAGAAGGAGGTATCGGA 

4201 + + + + + + 4260 

HTRLFHANEEEEPEKKEVSE - 

^GCTGCGCTCTGAGCTATGGGAGAAGGAAATGAAGCTTACAGACATCCGCTTGGAGGCCCT 

4261 + - + + + + + 4320 

LRSELWEKEMKLTDIRLEAL- 

CAACTCTGCCCACCAACTGGATCAGCTTCGGGAGACCATGCACAACATGCAGTTGGAGGT 

4321 + + + + + + 4380 

NSAHQLDQLRETMHNMQLEV- 

GGACCTGCTGGAAGCAGAGAATGACCGACTGAAGGTAGCCCCAGGCCCCTCATCAGGCTC 

4381 + + + + + — + 4440 

DLLEAENDRLKVAPGPSSQf. 



Fig- 16 (cont'd 4) 
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CACTCCAGGGCAGGTCCCTGGATCATCTGCATTATCTTCCCCACGCCGCTCCCTAGGCCT 

4441 + + + + + + 4500 

TPGQVPGSSALSSPRRSLGL - 

GGCACTCACCCATTCCTTCGGCCCCAGTCTTGCAGACACAGACCTGTCACCCATGGATGG 

4501 + + + + + + 4560 

ALTHS FGPSLADTDLS PMDG - 

CATCAGTACTTGTGGTCCAAAGGAGGAAGTGACCCTCCGGGTGGTGGTGAGGATGCCCCC 

4561 + + + + + + 4620 

ISTCGPKEEVTLRVVVRMPP 

GCAGCACATCATCAAAGGGGACTTGAAGCAGCAGGAATTCTTCCTGGGCTGTAGCAAGGT 

4621 + + + + + 7 + 4680 

Q H I I KGDLKQQEFF LGCSKV - 

CAGTGGAAAAGTTGACTGGAAGATGCTGGATGAAGCTGTTTTCCAAGTGTTCAAGGACTA 

4681 + + + + + + 4740 

SGKVDWKMLDEAVFQVFKDY - 

TATTTCTAAAATGGACCCAGCCTCTACCCTGGGACTAAGCACTGAGTCCATCCATGGCTA 

4741 + + + + + + 4800 

ISKMDPASTLGLSTES IHGY- 

CAGCATCAGCCACGTGAAACGAGTGTTGGATGCAGAGCCCCCCGAGATGCCTCCTTGCCG 

4801 + + + + + + 4860 

SISHVKRVLDAEPPEMPPCR- 

TCGAGGTGTCAATAACATATCAGTCTCCCTCAAAGGTCTGAAGGAGAAATGCGTCGACAG 

4861 + + + + + + 4920 

RGVNNI SVSLKGLKEKCVDS 

CCTGGTGTTCGAGACGCTGATCCCCAAGCCGATGATGCAGCACTACATAAGGCTCCTGCT 

4921 + + + + + + 4980 

LVFETLI PKPMMQHYI SLLL 

GAAGCACCGGCGCCTCGTCCTCTCGGGCCCCAGCGGCACGGGCAAGACCTACCTGACCAA 

4981 + + + + + + 5040 

KHRRLVLSGPSGTGKTYLTN- 

TCGCTTGGCCGAGTACCTGGTGGAGCGCTCTGGCCGTGAGGTCACAGAGGGCATCGTCAG 

5041 + + + + + + 5100 

RLAEYLV ERSGREVTEGIVS 

CACCTTCAACATGCACCAGCAGTCTTGCAAGGATCTGCAACTGTATCTTTCCAACCTAGC 

5101 + -f + " + + + 5160 

TFNMHQQSCKDLQLYLSNLA- 

"CAACCAGATAGACCGGGAAACAGGAATTGGGGATGTGCCCCTGGTGATTCTATTGGATGA 

5161 + + + + + + 5220 

NQIDRETGIGDVPLVI LLDD - 

CCTGAGTGAAGCAGGCTCCATCAGTGAGTTGGTCAATGGGGCCCTCACCTGCAAGTATCA 

5221 + + + + + + 5280 

LSEAGS I SELVNGALTCKYH 

TAAATGTCCCTATATTATAGGTACCACCAATCAGCCTGTAAAAATGACACCCAACCATGG 

5281 + + + + + 5340 

KCPYI I GTTNQPVKM T P N ~G 
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CTTTCACTTGAGCTTCAGGATGTTGACCTTCTCCAACAACGTGGAGCCAGCCAATGGCTT 

+ + + + + + 54Q0 

FHLS FRMLTFSNNVEPANG F 

CCTGGTTCGTTACCTGAGGAGGAAGCTGGTAGAGTCAGACAGCGACATCAATGCCAACAA 

+ + + + + + 546Q 

LVRYLRRKLVESDSDINANK- 



GGAAGAGCTGCTTCGGGTGCTCGACTGGGTACCCAAGCTGTGGTATCATCTCCACACCTT 
5461 + + + + + + 552Q 

EELLRVLDWVPKLWYHLHTF- 

CCTTGAGAAGCACAGCACCTCAGACTTCCTCATCGGCCCTTGCTTCTTTCTGTCGTGTCC 

5521 + + + + + + 5580 

L _ E KHSTSDFLIGPC FFLSCP- 

CATTGGCATTGAGGACTTCCGGACCTGGTTCATTGACCTGTGGAACAACTCTATCATTCC 

5581 + + + + + + 5640 

IGIEDFRTWFIDLWNNSIIP- 

CTATCTACAGGAAGGAGCCAAGGATGGGATAAAGGTCCATGGACAGAAAGCTGCTTGGGA 

5641 + + + + + + 5700 

YLQEGAKDGI KVHGQKAAWE - 

GGACCCAGTGGAATGGGTCCGGGACACACTTCCCTGGCCATCAGCCCAACAAGACCAATC 

5701 + + + + + + 5760 

DPVEWVRDTLPWPSAQQDQS 

AAAGCTGTACCACCTGCCCCCACCCACCGTGGGCCCTCACAGCATTGCCTCACCTCCCGA 

5761 + + + + + + 5820 

KLYHLPPPTVGPHSIASPPE- 

GGATAGGACAGTCAAAGACAGCACCCCAAGTTCTCTGGACTCAGATCCTCTGATGGCCAT 

5821 + + + + + + 5880 

DRTVKDSTPSSLDSDPLMAM- 

GCTGCTGAAACTTCAAGAAGCTGCCAACTACATTGAGTCTCCAGATCGAGAAACCATCCT 

5881 + + + + + + 5940 

LLKLQEAANYIESPDRETIL- 

GGACCCCAACCTTCAGGCAACACTTTAAGGGTTCGGCAATCACTGTCACCCCCGGACAGC 

5941 + + + + + + 6000 

DPNLQATL* 

AGAACGCTGGCATCAGCTATCTTAGCTCCTCCTCTCCCCTCTCCTCTTTCAGAGCACTGG 
6001 + + + ■ + + + 6060 

CTCTCCAGCC C C AGGAGGAGAACAGGAGGGAGGAGGAGATGAAAG AGG AGGGACAGGTTC 
6061 + + + + + + 6120 

TTGGTGCTGTACCTTTGAGAACTTCCTAGGAAGGAATGGTGGGGTGGCGTTTGGGAACTT 
6121 + + + + + + 6180 

GTGCCCCCTAAACACATTTACTGGCCTCCTCTAATGACTTTGGGGAAAAGATGATTCTGG 
6181 + + + + + + 6240 

GTCTTTCCCTTGACTTCTTGTTTCAATTACAAACTCCTGGGCTTTCTGGGGAGGGGTTCA 
6241 + ■ - + + + + Z"~~ + 6300 

;» 

Fig. 16 (cont ' d 6) 
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GAAAACATCAAAACACTGCAGCAGTTCCTAAATGATTCTCACAAGCAACCCTGAGAGAGA 
6301 + + + + + + 6360 

CAGTCTTGTGAGGGAGATCTGGGGGAGGCAGGAAGCTCCTCAGATTTTCTCACAGACCCT 
6361 + + + + + + 6420 

TCCCAATTCCATCACCACTGCCAACAACTCCTCCCCCAGAGATCTGGCTGGAGCCCAGAA 
6421 + + + + + + 6480 

AAAGAAGCATGTGGTTTAAAAAATGTTTAAATCAATCTGTAAAAGGTAAAAATGAAAAAC 
6481 + + + + + + 6540 

AAAAACAAGCAAACAAACAAAAAACAATGGAAAAGATGAAGCTGGAGAGAGAGGAACCAG 
6541 + + + + + : + 6600 

TTGCCAAGGTAGAGAGCTGCCCGCTCCTGCCCTCTGGATGACATAGGGGACATCAACAAG 
6601 + + + + + + 6660 

ACGGCTGCCAACCTGAGAAGTCACCAAACCACAAAAATAACCTTACAGCCTTCAGGGAAA 
6661 + + + + + + 6720 

GACTACCAGCTCTGTCTTTCTAC CCTCTAATTTAACAATGCATAAGAGTCAATAAAC C CT 
6721 + + + + + + 6780 

AC TTTTTTAAAAAAAAAAAAAAAAG . 
6781 + + 6805 
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T3-CDNA sequence and T3 protein 
encoded therein (protein isoform 1) 



CAACCAGCCAGAACGCCTGAACTCGCAGGTGCTGCAGGGGCTGCAGGAGCCAGCGGGGGA 
! + + + + + + go 

NQPERLNSQVLQGLQEPAGE- 

GGGG CTCCCGCTGCGGAAGAGCGGCTCGGTGGAAAACGGGTTCGATACCCAGATCTACAC 

61 + + + + + + 120 

GLPLRKSGSVENGFDTQIYT- 

AGACTGGGCCAATCATTACCTAGCCAAATCCGGCCACAAGCGTCTCATCAGGGATCTCCA 

121 + + + + + + 180 

DWANHYLAKSGHKRLIRDLQ- 

GCAAGATGTGACAGATGGCGTCCTCCTGGCCCAGATTATCCAGGTTGTGGCAAATGAAAA 

181 + + + + + + 240 

QDVTDGVLLAQI IQVVANEK- 

GATTGAAGACATCAATGGCTGTCCGAAGAACAGATCCCAAATGATTGAAAACATAGATGC 

241 + + + + + + 300 

IED INGCPKNRSQMIENIDA- 

CTGCTTGAATTTCCTGGCAGCTAAGGGAATAAACATCCAGGGGCTGTCTGCAGAAGAGAT 

301 + + + + + + 360 

CLNFLAAKGINIQGLSAEEI- 

CAGGAATGGAAACCTCAAGGCCATTCTAGGCCTCTTCTTCAGCCTCTCCCGATACAAGCA 

361 + + + + + + 420 

RNGNLKAILGLFFSLSRYKQ- 

GCAGCAGCAGCAGCCCCAGAAGCAGCACCTCTCCTCACCTCTGCCGCCCGCQGTATCCCA 

421 + + + + + + 480 

QQQQPQKQHLSSPLPPAVSQ- 

GGTGGCCGGGGCCCCCTCCCAGTGCCAGGCTGGCACCCCTCAGCAGCAGGTGCCAGTCAC 

481 + + + + + + 540 

VAGAPSQCQAGTPQQQVPVT - 

TCCCCAAGCCCCGTGCCAGCCTCACCAGCCAGCGCCACATCAGCAGTCAAAAGCACAAGC 

541 + + - + + + + 600 

PQAPCQPHQPAPHQQS K A Q A - 

TGAAATGCAGTCCAGACTTCCAGGTCCTACCGCGAGGGTATCCGCTGCAGGCAGCGAGGC 

601 + + + + + + 660 

EMQSRLPGPTARVSAAGSEA- 

CAAAACACGCGGAGGGTCAACTACTGCTAACAACCGACGCAGCCAGAGCTTTAACAACTA 

661 - + + + + + + 720 

KTRGGSTTANNRRSQS FNNY 

TGATAAATCCAAACCAGTCACCTCCCCACCCCCACCGCCAAGCAGCCACGAGAAAGAGCC 

721 + + + + + + 780 

DKSKPVTSPPPPPSSHEKEP- 

TTTGGCAAGTTCAGCCTCCTCCCACCCCGGAATGAGTGACAATGCACCTGCTTCCTTGGA 

781 + + + + + _---+ 840 

LASSASSHPGMSDNAPASL^E - 



Fig. 17 
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GAGCGGCAGCAGCTCCACCCCTACTAATTGCAGTACCTCCTCGGCCATCCCGCAGCCCGG 
h + + + + + 

SGSSSTPTNCSTSSAIPQPG 



TGCAGCCACCAAGCCTTGGCGCAGCAAATCCCTCAGCGTGAAGCACAGTGCCACGGTATC 
901 + + + + + -+ 960 

AATKPWRSKSLSVKHSATVS - 

CATGCTCTCGGTCAAGCCTCCTGGGCCTGAGGCCCCCAGGCCCACACCTGAAGCCATGAA 

961 + + + + + + 1020 

MLSVKPPGPEAPRPTPEAMK- 

GCCGGCCCCCAACAATCAGAAGTCCATGCTGGAAAAGCTGAAACTTTTCAACAGTAAAGG 

1021 + + + + + + 1080 

PAPNNQKSMLEKLKLFNSKG - 

GGGCTCAAAGGCAGGTGAGGGGCCGGGGTCCCGGGACACAAGCTGTGAGCGGCTGGAGAC 

1081 + + + + 4- + 1140 

GSKAGEGPGSRDTSCERLET 

TCTGCCCAGCTTCGAAGAGAGCGAGGAGCTGGAGGCCGCCAGTCGCATGCTCACCACCGT 

1141 + + + + + + 1200 

LPS FEESEELEAASRMLTTV- 

GGGCCCTGCTTCCAGCAGCCCCAAGATTGCACTCAAGGGCATTGCCCAGAGGACTTTTAG 

1201 + + + + + + 1260 

GPAS S S PKIALKGIAQRTFS - 

CCGGGCACTGACCAACAAGAAGAGTTCTCTGAAAGGCAATGAGAAAGAGAAGGAGAAACA 

1261 + + + + + + 1320 

RALTNK KS S LKGNE KE KEK Q 

ACAGCGGGAGAAGGATAAGGAGAAAAGCAAGGACCTTGCCAAGAGAGCCTCTGTGACGGA 

1321 + + + + + + 1380 

QREKDKEKSKDLAKRASVTE - 

GAGGCTGGACCTCAAGGAGGAGCCAAAAGAAGACCCCAGTGGAGCAGCTGTGCCCGAGAT 

1381 + + + + + + 1440 

RLDLKEEPKEDPSGAAVPEM- 

GCCAAAAAAGTCCTCCAAGATTGCCAGCTTCATCCCCAAAGGGGGGAAGCTCAACAGTGC 

1441 + + r + - + + + 1500 

PKKS SKIASFIPKGGKLNSA- 

CAAGAAGGAGCC C ATGG CC C CTTCC CACAGTGGAATACCAAAAC C AGGAATGAAGAGCAT 

1501 + + + + + + 1560 

KKEPMAPSHSGIPKPGMKSM- 

GCCCGGGAAATCCCCAAGTGCCCCAGCGCCTTCCAAGGAAGGGGAGCGGAGCCGGAGTGG 

1561 * + + + + + + 1620 

PGKS PSAPAPSKEGERSRSG 

GAAGCTGAGCTCAGGACTCCCCCAGCAGAAGCCCCAGCTGGACGGCAGACACTCCAGTTC 

1621 + + + + + + 1680 

KLS SGLPQQKPQLDGRHSSS 

CTCTTCCAGCCTGGCGTCCTCAGAAGGAAAAGGCCCAGGAGGGACCACCCTGAACCACAG 

1681 + + + + + + 1740 

SSS. LASSEGKGPGGTTLNffS 

^ — 

Fig. 17 (cont'd 1) 
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CATCAGCAGCCAGACTGTCAGTGGGTCTGTCGGGACCACCCAGACCACAGGAAGCAATAC 

1741 + + + + + ---+ 1800 

I SS QTVSGSVGTTQTTGSN T - 

CGTCAGTGTTCAGCTACCTCAGCCCCAGCAGCAATACAACCATCCCAACACTGCCACGGT 

1801 + + + + + + 1860 

VSVQLPQPQQQYNHPNTATV 

TGCACCTTTCCTGTACAGGTCTCAGACGGACACTGAAGGGAATGTTACTGCCGAGTCAAG 

1861 + + + + + + 1920 

APFLYRSQTDTEGNVTAESS 

CTCAACAGGTGTGAGCGTGGAGCCCAGCCACTTCACCAAGACTGGACAGCCTGCTCTGGA 

1921 + + + + + + 1980 

STGVSVEPSHFTKTGQPALE - 

AGAACTCACTGGGGAAGATCCTGAGGCTCGGCGGCTGCGGACAGTGAAGAACATCGCTGA 

1981 + + + + + + 2040 

ELTGEDPEARRLRTVKNIAD - 

TCTGCGGCAGAATTTGGAGGAAACCATGTCCAGTTTAAGGGGAACTCAGGTTACACACAG 

2041 + + + + + + 2100 

LRQNLEETMSSLRGTQVTHS - 

C ACATTGGAAAC CACGTTTG AC ACCAATGTC ACCACGGAGATGAGTGGC CGTAGCATACT 

2101 + + + + + + 2160 

TLETTFDTNVTTEMSGRSIL- 

CAGCTTGACAGGGAGGCCCACACCTCTGTCCTGGAGACTGGGCCAGTCCAGCCCTCGGCT 

2161 + --" + + + + + 2220 

SLTGRPTPLSWRLGQSSPRIi- 

CCAAGCAGGAGACGCCCCCTCAATGGGCAATGGGTATCCCCCTCGAGCCAACGCCAGCAG 

2221 + + + + + "\ + 2280 

QAGDAPSMGNGYPPRANASR- 

GTTCATCAACACTGAGTCAGGTCGCTATGTGTACTCCGCCCCTCTGAGAAGGCAGCTGGC 

2281 + + + + + + 2340 

FINTESGRYVYSAPLRRQLA- 

CTCCCGGGGCAGTAGTGTCTGCCACGTGGACGTCTCAGACAAGGCAGGAGATGAGATGGA 

2341 + + - + + + + 2400 

SRGS SVCHVDVSDKAGDEMD 

CCTGGAAGGCATCAGCATGGACGCCCCCGGCTACATGAGCGATGGGGATGTTCTGAGCAA 

2401 + + + + + + 2460 

LEGISMDAPGYMSDGDVLSK- 

GAACATCCGGACCGATGACATTACAAGCGGATACATGACTGATGGTGGACTTGGCCTCTA 

2461 * + + + + + + 2520 

NIRTDDITSGYMTDGGLGLY- 



TACCCGTCGCCTGAACCGGCTCCCTGATGGGATGGCTGTGGTACGGGAGACCCTGCAACG 

+ + + + + + 2£ 

TRRLNRLPDGMAVVRETLQR- 



AAATACCTCCCTGGGCCTCGGAGACGCTGACAGCTGGGACGACAGCAGCTCCGTCAGCAG 

2581 + + + + + + 2640 

NTSLGLGDADSWDDSSSV5S- 
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CGGCATCAGCGACACCATAGACAACCTCAGCACTGATGACATCAACACCAGCTCCTCCAT 

2641 + + + + + + 2700 

GISDTIDNLSTDDINTSSS I 

CAGCTCTTATGCCAACACACCTGCCTCCTCTCGAAAAAACCTGGATGTGCAGACTGATGC 

2701 + + + + + + 2760 

S SYANTPAS SRKNLDVQTDA- 

TGAGAAGCACTCACAGGTGGAGAGGAATTCCCTGTGGTCTGGTGATGATGTCAAGAAATC 

2761 + + + + + + 2820 

EKHSQVERNSLWSGDDVKKS 

AGACGGAGGCTCAGACAGCGGCATAAAAATGGAGCCAGGTTCCAAGTGGAGGCGGAATCC 

2821 + + + + + + 2880 

DGGSDSGIKMEPGSKWRRNP- 

TTCTGATGTGTCTGACGAGTCCGACAAAAGCACGTCGGGCAAGAAGAATCCTGTCATCTC 

2881 + + + + + + 2940 

SDVSDESDKSTSGKKNPVIS - 

CCAGACAGGCTCATGGCGGCGAGGCATGACAGCTCAGGTGGGCATCACCATGCCAAGGAC 

2941 + + + + + + 3000 

QTGSWRRGMTAQVGITMPRT- 

GAAGGCTTCAGCCCCGGCAGGCGCACTGAAGACCCCAGGAACTGGAAAAACAGACGACGC 

3001 + + + + + + 3060 

KASAPAGALKTPGTGKTDDA- 

AAAGGTGTCTGAGAAAGGAAGGCTTTCTCCTAAAGCCTCCCAGGTGAAGCGCTCCCCATC 

3061 + + + + + + 3120 

KVSEKGRLSPKASQVKRSPS 

AGATGCAGGCCGGAGCAGTGGTGACGAATCCAAAAAGCCCCTCCCCAGCAGCTCTAGGAC 

3121 + + + + + -T- + 3180 

DAGRSSGDESKKPLPS S SRT 

ACCTACTGCCAATGCCAACAGCTTTGGGTTCAAGAAGCAGAGTGGTTCCGCCACCGGCCT 

3181 + + + + + + 3240 

PTANANS FGFKKQSGSATGL - 

GGCCATGATCACAGCCAGCGGGGTGACTGTCACCAGCAGGTCAGCCACACTGGGCAAAAT 

3241 + + - + + - -- + + 3300 

AMITASGVTVTSRSATLGKI 

CCCAAAGTCATCTGCACTCGTCAGTCGGTCTGCTGGTCGGAAGTCAAGTATGGATGGGGC 

3301 + + + + + + 3360 

PKSSALVSRSAGRKSSMDGA- 

TCAGAATCAGGATGACGGGTATCTAGCCCTAAGCTCCCGGACAAACCTTCAGTACCGGAG 

3361 * + + + + + + 3420 

QNQDDGYLALSSRTNLQYRS - 

TTTGCCGAGGCCCAGTAAGTCCAACAGCCGGAACGGGGCTGGGAACAGGTCTAGCACCAG 

3421 + + + + + + 3480 

LPRPSKSNSRNGAGNRSSTS - 

CAGCATAGATTCCAACATTAGCAGCAAGTCCGCAGGCCTGCCAGTGCCCAAACTGAGGGA 

3481 + + + + - + + 3540 

SIDSNISSKSAGLPVPKLtfE- 
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GCCTTCCAAAACAGCCCTAGGCAGCTCTCTACCAGGTCTGGTCAACCAAACAGACAAGGA 
3541 + + + + + + 3600 

PSKTALGSSLPGLVNQTDK 'E- 

GAAAGGCATCTCATCAGACAACGAGAGTGTGGCTTCCTGTAACTCGGTGAAAGTGAATCC 
360 l + + + + + + 366Q 

KGISSDNESVASCNSVKVNP- 

GGCAGCCCAGCCTGTGTCCAGTCCGGCTCAGACCAGTCTCCAGCCTGGAGCCAAGTACCC 

3661 + + + + + + 3720 

AAQPVSSPAQTSLQPGAKYP 

AGATGTGGCCTCTCCCACACTCCGCAGACTCTTTGGTGGGAAGCCTACCAAGCAAGTGCC 
3721 + + + + + + 3780 

DVAS PTLRRLFGGKPTKQVP 

CATCGCCACAGCTGAAAACATGAAAAATTCGGTGGTCATCTCCAATCCTCATGCCACCAT 

3781 + + + + + + 3840 

IATAENMKNSVVI SNPHATM 

GACTCAGCAAGGTAACCTAGACTCCCCGTCAGGCAGTGGCGTCCTGAGCAGTGGGAGCAG 

3841 + + + + + + 3900 

TQQGNLDSPSGSGVLSSGSS - 

CAGTCCTCTCTACAGCAAGAATGTGGACCTCAACCAGTCTCCGCTAGCCTCCAGCCCCAG 

3901 + + + + + + 3960 

SPLYSKNVDLNQSPIiASSPS - 

CTCAGCCCACTCGGCCCCTTCCAACAGCCTCACCTGGGGCACCAACGCCAGCAGCTCCTC 

3961 + + + + + + 4020 

SAHSAPSNSLTWGTNASSSS- 

CGCAGTTAGCAAGGATGGCCTGGGCTTTCAGTCTGTCAGCAGCCTCCACACCAGCTGTGA 

4021 + + + + + -,- + 4080 

AVSKDGLGFQSVSSLHTSCE- 

GTCCATCGACATCTCCCTCAGCAGTGGAGGGGTCCCCAGCCACAATTCTTCCACTGGCCT 

4081 + + + + + + 4140 

SIDISLSSGGVPSHNSSTGL- 

C ATCGCCTCCTC CAAGGACGACTCCTTGACTC CCTTTGTCAGAACTAACAGTGTGAAGAC 

4141 + + - + + + + 4200 

IASSKDDSLTPFVRTNSVKT- 

CACACTGTCAGAAAGCCCTCTCTCTTCCCCTGCTGCTAGCCCTAAGTTCTGCAGAAGTAC 

4201 + + + + + + 4260 

TLSESPLSSPA ASPKFCRST- 

TCTGCCCAGGAAACAGGACAGTGACCCGCACCTTGATAGGAACACTTTGCCTAAGAAAGG 

4261" + + + + + + 4320 

LPRKQDSDPHLDRNTLPKKG- 

ACTCAGGTATACTCCCACCTCCCAGCTTCGCACGCAAGAAGATGCAAAAGAATGGTTACG 

4321 + + + + + + 4380 

LRYTPTSQLRTQEDAKEWLR- 

GTCCCATTCTGCAGGAGGCCTTCAGGACACCGCTGCCAATTCCCCCTTTTCCTCTGGCTC 
4381 + + + + + + 4440 

SHSAGGLQDTAANSPFSSGS 

« — 

Fig. 17 (cont'd 4) ■ 
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CAGCGTGACTTCTCCCTCCGGAACAAGATTCAACTTTTCCCAGCTTGCGAGTCCCACCAC 
-l + + + H + 

SVTS PSGTRFNFSQLAS P T " T 



TGTCACCCAGATGAGCTTGTCCAACCCGACCATGCTGAGGACTCACAGCCTCTCCAATGC 
4501 + + + + + + 456Q 

VTQMSLSNPTMLRTHSLSNA- 

TGATGGGCAGTATGATCCATACACTGACAGCCGCTTCCGGAATAGCTCCATGTCCCTGGA 
4561 + + + + + + 462Q 

DGQYDPYTDSRFRNSSMSLD- 

TGAGAAGAGCAGAACCATGAGCCGTTCAGGCTCATTCCGGGATGGGTTT.GAAGAAGTTCA 
4621 + + + + + + 4680 

EKSRTMSRSGSFRDGFEEVH- 

TGGATCCTCACTCTCCTTGGTTTCCAGCACATCGTCAGTTTATTCTACACCAGAAGAAAA 

4681 + + + + + + 4740 

GSSLSLVSSTSSVYSTPEEK- 

ATGCCAGTCAGAGATTCGCAAGCTGCGGCGGGAACTGGATGCCTCCCAGGAGAAAGTTTC 

4741 + + + + + + 4800 

CQSE IRKLRRELDASQEKVS 

AGCTTTGACCACCCAGCTGACAGCAAATGCTCACCTTGTGGCTGCCTTTGAACAGAGTCT 

4801 + + + + + + 4860 

ALTTQLTANAHLVAAFEQSL - 

TGGTAACATGACAATCAGGCTCCAGAGTCTGACCATGACAGCTGAGCAGAAGGATTCAGA 

4861 + + + + + + 4920 

GNMTIRLQSLTMTAEQKDSE 

ACTGAATGAGTTAAGAAAAACCATTGAGCTGCTAAAGAAACAGAACGCAGCTGCCCAGGC 

4921 + + + + + + 4980 

LNELRKTI ELLKKQNAAAQA- 

TGCCATTAATGGAGTAATTAACACACCTGAGCTCAACTGCAAAGGAAACGGCACTGCCCA 

4981 + + + + + + 5040 

AINGVINTPELNCKGNGTAQ- 

GTCTGCAGACCTCCGCATCCGCAGGCAGCACTCCTCAGACAGCGTCTCCAGCATCAACAG 

5041 + + - + + + + 5100 

SADLRIRRQHSSDSVSS INS - 

TGCCACCAGCCACTCCAGTGTGGGCAGCAACATAGAGAGTGACTCAAAGAAGAAGAAGAG 

5101 + + + + + + 5160 

ATSHSSVGSNIESDSKKKKR- 

GAAGAACTGGGTCAATGAGTTACGCAGCTCCTTCAAGCAAGCTTTCGGGAAGAAGAAGTC 

5161 + + + + + + 5220 

KNWVNELRSS FKQAFGKKKS - 

CCCAAAATCTGCGTCCTCTCATTCAGATATTGAGGAGATGACGGATTCTTCTTTGCCTTC 

5221 + + + + + + 5280 

PKSASSHSDIEEMTDSSLPS- 

CTCACCAAAGTTACCACACAATGGGTCCACAGGTTCCACCCCACTGCTGAGGAATTCTCA 

5281 + + + + + + 5340 

SPKLPHNGSTGSTPLLRNSH- 
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CTCCAACTCTCTAATTTCAGAATGCATGGATAGTGAAGCTGAGACCGTCATGCAGCTCCG 

+ + + + + + 5400 

SNSL I SECMDSEAETVMQL""R 

AAATGAGTTAAGAGACAAGGAGATGAAGCTGACAGATATCCGCTTAGAAGCTCTCAGTTC 

+ + + + + + 5460 

NELRDKEMKLTDIRLEALSS - 

TG CC CACCAGCTGGACCAGCTCCGGGAGGCCATGAACAGGATGCAGAGTGAAATAG AGAA 

+ + + + + + 5520 

AHQLDQLREAMNRMQS E I E K 



GCTGAAAGCTGAGAATGATCGGCTGAAGTCAGAGTCTCAAGGCAGTGGCTGCAGCCGGGC 

5521 + + + + + + 5580 

LKAEMDRLKSESQGSGCSRA- 

TCCTTCCCAAGTGTCCATCTCTGCCTCCCCGAGGCAGTCCATGGGCCTCTCCCAGCACAG 

5581 + + + + + + 5640 

PSQVSISASPRQSMGLSQHS- 

CTTGAACCTCACTGAGTCAACCAGCCTGGACATGTTGCTGGATGACACTGGTGAATGCTC 

5641 + + + + + + 5700 

LNLTESTS LDMLLDDTGECS 

GG CTCGGAAGGAAGGAGGCAGG C ATGTTAAGATAGTTGTCAGCTTTC AGGAGGAAATGAA 
5701 + + + + + + 5760 

ARKEGGRHVKIVVSFQEEMK- 

GTGGAAGGAGGATTCCAGACCACATCTCTTTCTTATTGGCTGCATTGGAGTTAGTGGCAA 
5761 + + + + + + 5820 

WKEDSRPHLFLIGCIGVSGK- 

GACGAAGTGGGATGTGCTCGATGGGGTGGTTAGACGGCTGTTCAAAGAATACATCATTCA 

5821 + + + + + - T + 5880 

TKWDVLDGVVRRLFKEYIIH- 

TGTCGACCCAGTGAGTCAGCTAGGGCTGAATTCAGACAGCGTTCTTGGCTACAGCATTGG 

5881 + + + + + + 5940 

VDPVSQ LGLNSDSVLGYSIG- 

AGAAATCAAGCGCAGCAACACTTCCGAAACACCGGAGCTGCTTCCTTGTGGCTATCTGGT 

5941 + + + + + + 6000 

EIKRSNTS ETPELLPCGYLV- 

TGGAGAGAACACGACCATCTCAGTGACTGTGAAAGGGCTCGCAGAAAACAGCCTGGACTC 

6001 + + + + + + 6060 

GENTTISVTVKGLAENSLDS - 

ACTGGTGTTTGAGTCCTTGATTCCCAAGCCCATCCTGCAGCGCTACGTCTCCCTCCTGAT 

6061 * + + + + + + 6120 

LVFESLIPKPILQRYVSLLI 

AGAGCACCGTCGGATCATTCTCTCTGGCCCCAGCGGCACTGGGAAAACCTACCTGGCCAA 

6121 + + + + + + 6180 

EHRRI ILS GPSGTGKTYLAN- 

CCGGCTGTCTGAGTATATAGTGCTTCGAGAGGGACGGGAGTTGACAGACGGGGTTATCGC 

6181 + + + + + + 6240 

RLSEYIVLREGRELTDGVfA- 
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CACCTTTAACGTGg?TCCATAAGTCCAGCAAGGAATTGCGCCAGTACCTGTCCAACCTTGC 

6241 + + + + + + 6300 

TFNVDHKS S KELRQYLSNLA- 

TGACCAGTGCAACAGTGAGAACAATGCTGTGGACATGCCCCTCGTCATCATCCTGGACAA 

6301 + + + + + + 6360 

DQCNSENNAVDMPLVI I L D N - 

CCTACACCACGTGAGCTCTCTGGGCGAGATCTTCAATGGGCTGCTCAACTGCAAGTACCA 

6361 + + + + + + 6420 

GGATGTGGTGCACTCGAGAGACCCGCTCTAGAAGTTACCCGACGAGTTGACGTTCATGGT 
LHHVSSIjGEIFNGLLNCKYH- 

CAAATGCCCTTACATAATTGGCACAATGAACCAGGCTACCTCTTCGACTCCCAACCTGCA 

6421 + + + + + + 6480 

GTTTACGGGAATGTATTAACCGTGTTACTTGGTCCGATGGAGAAGCTGAGGGTTGGACGT 
KCPYI IGTMNQATS-STPNLQ 

GCTTCACCATAACTTCAGATGGGTGCTTTGTGCCAACCACACGGAGCCTGTGAAGGGTTT 

6481 + + -t- + + + 6540 

CGAAGTGGTATTGAAGTCTACCCACGAAACACGGTTGGTGTGCCTCGGACACTTCCCAAA 
LHHNFRWVLCANHTEPVKGF - 

CCTTGGCCGATTCCTGAGGAGGAAGCTCATGGAAACAGAGATCAGTGGGCGGGTGCGCAA 

6541 + + + + + + 6600 

GGAACCGGCTAAGGACTCCTCCTTCGAGTACCTTTGTCTCTAGTCACCCGCCCACGCGTT 
LGRFLRRKLMETEISGRVRN- 

TATGGAGCTGGTAAAAATCATTGACTGGATTCCCAAGGTCTGGCATCACCTCAACCGCTT 

6601 + + + + + + 6660 

ATACCTCGACCATTTTTAGTAACTGACCTAAGGGTTCCAGACCGTAGTGGAGTTGGCGAA 
MELVKIIDWIPKVWHHLNRF- 

CCTGGAGGCTCACAGTTCCTCGGACGTCACCATCGGCCCCCGGCTCTTCCTGTCATGCCC 

6661 + + + + + - + 6720 

LEAHS SSDVTIGPRLFLSCP 

CATCGATGTGGACGGCTCGAGAGTGTGGTTCACCGACTTGTGGAACTATTCCATTATCCC 

6721 + + + + + + 6780 

IDVDGSRVWFTDLWNYSIIP- 

CTATCTCCTGGAAGCCGTCAGAGAAGGACTCCAGCTCTATGGAAGGCGCGCCCCCTGGGA 

6781 + + + + + + 6840 

YLLEAVREGLQLYGRRAPWE - 

GGATCCTGCCAAGTGGGTGATGGACACATATCCATGGGCAGCCAGCCCACAACAGCACGA 

6841 + + + + + + 6900 

PPAKWVMDTYPWAAS PQQHE - 

GTGGCCTCCCCTGCTGCAGTTACGGCCTGAGGATGTCGGCTTCGACGGCTACTCCATGCC 

6901 + + + + + + 6960 

WPPLLQLRPEDVGFDGYSMP- 

TCGGGAGGGATCGACAAGCAAGCAGATGCCCCCCAGTGATGCTGAAGGTGACCCGCTGAT 

6961 + + + + + + 7020 

REGSTSKQMPPSDAEGDPLM 

GAACATGCTGATGAGGCTGCAGGAGGCAGCCAACTACTCCAGCCCCCAGAGCTATGACAG 

7021 + + + + + + 7080 

'NMLMRLQEAANYSS PQSYQ.S 
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cgactccaacagSacagccatcacgatgacatcttggactcctctttggagtccactct 

7081 + + + + + + 7140 

DSNSNSHHDDILDSSLESTL- 

GTGACAGGGGCCCGGAGCCCAGCGCCCTCCTCTTCTCCTCACCGCATTCCACCTGCATCC 
7141 + + + + + + 7200 



CCCACATCACCCTGAAGATGACTTCCTGAGCCAGCCCCCAGCCACAGCCTTAGAGCTGCG 
7201 + + + + + + 7260 

GGAACACCGAGACCCCCCGTCCTTCAGCCTCGACCTGGGTGCAGGCATCCCGGGCCAGCT 
7261 + + + + + -+ 7320 

GCCTGCGGACCGCTTCCTTCCACAGCGAGAACTGCACTACCTTCTGTTGTACTTTAATTA 
7321 + + + + + + 7380 

TTGTTTTGCCTTGTTGCTGTGACCTCCCTAAGACACTGAAGATACTTCTCGGGAAAGGAT 
7381 + + + + ■ + + 7440 

CATCGCCGTTGAAATGAAAAGAGAGACAGAGAGAGAAAAAAAAAAGAGAACCCACATGAA 
7441 + + + + + + 7500 

GCTCTGAAACCAAACAGCATCCTGCCATGAGCTTCCCAGAGACAGAAGAGACTGGAGCAA 
7501 + + + + + + 7560 

AGTCGGAAACACAGAGAAGCACGGCTTCCCCTCAGCACAGACCCTCCAGACTGGGTCTCA 
7561 + + + + + + 7620 

GAGCCGTGCCACCCACCCTCCCACACAGCCGGCCACAGGGAGAACTGGTGCTAACCAGGG 
7621 + + + + + + 7680 

TGCTTGCTTTGGTCACGTTCAACGCACTACAGAGCTACGACACAGGGGAACCTTAGGAGC 
7681 + + + + + - + 7740 

AAATAAACCGTGCTTTCATGTTTTTTAAAAAAAAAAAAAAAAA 
7741 + + + + 7783 
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T3-CDNA sequence and T3 protein encoded 
therein (isoform 2) 



AGCAGGGAGAGGGGAGGGAGTGTGCCGTCTCTTCTGCAAGGGCAGTGCCCCAGCCTCAGC 



1 + + + + + + 60 

SRERGGSVPSLLQGQCPSLS 

CACACTTCTGATCTGCAGTCCAACAGACCTTTCTAGCATGCCAAAGAGAACCTGGGGGTG 

61 + + + + + - + 120 

HTSDLQSNRPF*HAKENLGV 

CCAGGGGGTCCTCAGAGCTCACACTGCACTTGTGGCACCCACAGCGAGTAGCCATCCGTG 

121 + + + + + + 180 

PGGPQSSHCTCGTHSE*PSV 

AGCCGAGGAAACTGTACACAGATCTACACAGACTGGGCCAATCATTACCTAGCCAAATCC 

181 + + - + + + + 240 

SRGNCTQI YTDWANHYLAKS 

GGCCACAAGCGTCTCATCAAGGATCTCCAGCAAGATGTGACAGATGGCGTCCTCCTGGCC 

241 + + + + + + 300 

GHKRLIKDLQQDVTDGVLLA 

CAGATTATCCAGGTTGTGGCAAATGAAAAGATTGAAGACATCAATGGCTGTCCGAAGAAC 

301 + + + + + + 360 

QIIQVVANEKIEDINGCPKN 

AGATC C CAAATGATTGAAAACATAGATGC CTGCTTGAATTTCCTGGCAGCTAAGGGAATA 

361 + + + + + + 420 

RSQMIENIDACLNFLAA KGI 

AACATCCAGGGGCTGTCTGCAGAAGAGATCAGGAATGGAAACCTCAAGGCCATTCTAGGC 

421 + + + + + + 480 

NIQGLSAEEIRNGNLKAILG 

CTCTTCTTCAGCCTCTCCCGATACAAGCAGCAGCAGCAGCAGCCCCAGAAGCAGCACCTC 

481 + + + + + + 540 

LFFSLSRYKQQQQQPQKQHL 

TCCTCACCTCTGCCGCCCGCCGTATCCCAGGTGGCCGGGGCCCCCTCCCAGTGCCAGGCT 

541 + + + + + --- + 600 

SSPLPPAVSQVAGAPSQCQA 

GGCACCCCTCAGCAGCAGGTGCCAGTCACTCCCCAAGCCCCGTGCCAGCCTCACCAGCCA 

601 + + + + + + 660 

GTPQQQVPVTPQAPCQPHQP 
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T3 murine cDNA 



ATGAGAAGAGCCGAACAATGAGTCGGTCAGGCTCCTTCCGGGATGGGTTTGAGGAAGTTC 

! + + + + + + 60 

EKS RTMSRSGS FRDGFEEVH- 

ATGGATCCTCCCTGTCCTTGGTTTCCAGCACATCCTCCATCTACTCCACGCCAGAAGAAA 

61 + + + + + + 120 

GSSLSLVSSTSSIYSTPEEK- 

AATGCCAGTCAGAGATTCGAAAGCTGAGGCGAGAACTGGATGCCTCCCAGGAAAAGGTGT 

121 + + + + + + 180 

CQSEIRK-LRRELDASQEKVS- 

AATGC C AGT C AGAGATT CGAAAG CTGAGG CGAG ACGTG G ATG C CTC C C AGG AAAAGGTGT 

121 + + + + + + 180 

CQS E IRKLRRDVDASQE KVS- 

CTGCGCTGACTACCCAGCTGACTGCAAATGCTCACCTTGTGGCAGCCTTCGAGCAGAGTC 

181 + + + + + + 240 

ALTTQLTANAHLVAAFEQSL- 



TGGGAAACATGACCATCAGGCTACAGAGTTTAACTATGACCGCTGAGCAGAAGGATTCAG 

+ + + + + + 3( 

GNMTIRLQSLTMTAEQKDSE- 

AACTGAACGAGTTAAGAAAAACCATCGAGCTGCTGAAGAAACAGAATGCAGCTGCCCAGG 

+ + + + + + 3( 

LNE LRKTI ELL KKQNAAAQA- 



CTG C CATTAATGGAGTG ATTAACACGC CAGAGCTCAACTGC AAAGGAAATG GC AGTGC C A 

361 + + + + + + 420 

AIN GVINTPELNCKGNGSAR- 



GGCTACAGACCTACGCATCCGCAGCAACACTCCTCCGACAGTGTCTCCAGTATCAATAGC 

+ + + + + + 

LQTYASAATLLR QCLQYQ * R 



GCCACCAGCCACTCAAGTGTG 

481 + + ~ 501 

H Q P L K C 



Fig, 19 
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CAGCCTCTCCAACCGCTCGTA 
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•T2 97/124 Mjj^ 

CCTCCTCCACCTACTCCTCAC ^^AG 



SLSNRS ASSTYSS 



lGAAGAAAAAAAAGAGTTGGC 
K K K K S W 



AGCCTCTCCAACCGCTCGTAC 
S L S N R S Y 

GGCT CT CC AACCGCT CGT AC C 
S L S N R S Y 

CCTCTCCAACCGCTCGTACCC 
It S N R S Y 

CTCTCCAACCGCTCGTACCCT 
L S N R S Y P 

TCTCCAACCGCTCGTACCCTC 
L S N R S Y P 

CTCCAACCGCTCGTACCCTCT 
S N R S Y P 

TC CAACCGCT CGT ACCCT CTG 
S N R S Y P L 

CCAACCGCT CGT ACCCT CTGT 
S N R S Y P L 

CAACCGCT CGT AC CCT CTGTC 
N R S Y P L 

AACCGCT CGT ACCCTCTGTCA 
N R S Y P L S 

ACCGCTCGTAC CCT CTGT CAT 
N R S Y P L S 

CCGCTCGTACCCTCTGTCATG 
R S Y P Ii S 

CGCTCGTACCCTCTGTCATGG 
R S Y P L S W 

GCTCGTACCCTCTGTCATGGC 
S Y P I» S W 

CTCGTACCCTCTGTCATGGCG 
S Y P L S W 

TCGTACCCTCTGTCATGGCGC 
S Y P L S W R 

CGTACCCT CTGTC ATGGCGCT ' 
S Y P L S W R 

GTACCCTCTGTCATGGCGCTA 
Y P L S W R 

TACCCTCTGTCATGGCGCTAT 
Y P Ii S W R Y 



CTCCTCCACCTACTCCTCACA 
S S T Y S S 

TCCTCCACCTACTCCTCACAA 
S S T Y S S Q 

CCTCCACCTACTCCTCACAAA 
S S T Y S S Q 

CT CC AC CT ACT CCT CAC AAAT 
S T Y S S Q 

T C C AC CT ACTCCT CACAAAT C 
S T Y S S Q I 

CCACCTACTCCTCACAAATCC 
S T Y S S Q I 

CACCT ACT CCT CAC AAATCCG 
T Y S S Q.I 

AC CT ACT CCT CACAAATCCGG 
T Y S S Q I R 

CCT ACT CCT CACAAATCCGGA 
T Y S S Q I R 

CTACTCCTCACAAATCCGGAA 
Y S S Q I R 

TACT CCT CACAAATCCGGAAG 
Y S S Q I R K 

ACT C CTC ACAAAT C CGGAAGC 
Y S S Q I R K 

CTCCTCACAAATCCGGAAGCT 
S S Q I R K 

T CCT CACAAAT CCGGAAGCTT 
S S Q I R K L 

CCTCACAAATCCGGAAGCTTC 
S S Q I R K L 

CTCACAAATCCGGAAGCTTCG 
S Q I R K L 

TCACAAATCCGGAAGCTTCGT 
S Q I R K L R 

CACAAAT C CGGAAG CTT CGT A 
S Q I R K L R 

ACAAAT CCGGAAGCTT CGT AG 
Q I R K L R 



GAAGAAAAAAAAGAGTTGGCT 
K K K K S W 

AAGAAAAAAAAGAGTTGGCTT 
K K K K S W L 

AGAAAAAAAAG AGTT GGCTT C 
K K K K S W L 

G AAAAAAAAGAGT TGGCT T CG 
K K K S W L 

AAAAAAAAGAGTTGGCTTCGA 
K K K S W L R 

AAAAAAAGAGTTGGCTTCGAA 
K K K S W L R 

AAAAAAGAGTTGGCTT CGAAG 
K K S W L R 

AAAAAGAGTTGGCTTCGAAGT 
K K S W L R S 

AAAAGAGTTGGCTTCGAAGTT 
K K S W I* R S 

AAAGAGTTGGCTTCGAAGTTC 
K S W L R S 

AAGAGTTGGCTTCGAAGTTCC 
K S W L R S S 

AGAGTTGGCTTCGAAGTTCCT 
K S W L R S S 

GAGTTGGCTTCGAAGTTCCTT 
S W L R S S 

AGTTGGCTTCGAAGTTCCTTC 
S W L R S S F 

GTTGGCTTCGAAGTTCCTTCA 
S W L R S S F 

TTGGCTTCGAAGTTCCTTCAA 
W L R S S F 

T GGCTT CGAAGTTCCTT CAAC 
W L R S S F N 

GGCTTCGAAGTTCCTTCAACA 
W L R S S F N 

GCTTCGAAGTTCCTTCAACAA 
L R S S F N 



ACCCTCT GTC AT GGCGCT AT G 
Y P L S W R Y 



Fig. 20 



T2 

CTCCATCAAGTCCTCCACCTC 
S I K S S T 

TCCATCAAGTCCTCCACCTCG 
S I K S S T S 

CCATCAAGTCCTCCACCTCGT 
S I K S S T S 

CATCAAGTCCTCCACCTCGTC 
I K S S T S 

ATCAAGTCCTCCACCTCGTCC 
I K S S T S S 

T CAAGT C CTCCAC CT CGT CCT 
I K S S T S S 

CAAGTCCTCCACCTCGTCCTC 
K S S T S S 

AAGTCCTCCACCTCGTCCTCC 
K S S T S S S 

AGTCCTCCACCTCGTCCTCCG 
K S S T S S S 

GTCCTCCACCTCGTCCTCCGT 
S S T S S S 

TCCTCCACCTCGTCCTCCGTG 
S S T S S S V 

CCTCCACCTCGTCCTCCGTGG 
S S T S S S V 

CTCCACCTCGTCCTCCGTGGG 
S T S S S V 

TCCACCTCGTCCTCCGTGGGC 
S T S S S V G 

CCACCTCGTCCTCCGTGGGCA 
T S S S V G 

CACCT CGT CCT C CGT GGGCAC 
T S S S V G 

ACCTCGTCCTCCGTGGGCACT 
T S S S V G G 

CCT CGT C CT C CGTGGGCACTG 
T S S S V G G 

CTCGTCCTCCGTGGGCACTGA 
S S S V G G 

TCGTCCTCCGTGGGCACTGAT 
S S S V G G T 

CGTCCTCCGTGGGCACTGATG 
S S S V G G T 



T2 
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AGTTGGAGGT GGAC CTGCT GG 
L E V D L L 




ATGACACCCAACCATGGCTTT 
M T P N H G F 



GTTGGAGGTGGACCTGCTGGA 
I* E V D L L 



TGACACCCAACGATGGCTTTC 
M T P N H G F 



TTGGAGGTGGACCTGCTGGAA 
L E V D L L E 



GACACCCAACCATGGCTTTCA 
T P N H G F 



TGGAGGTGGACCTGCTGGAAG 
L E V D L L E 



ACACCCAACCATGGCTTTCAC 
T P N H G F H 



GGAGGTGGACCTGCTGGAAGC 
E V D L L E 



CACCCAACCATGGCTTTCACT 
T P N H G F H 



GAGGTGGACCTGCTGGAAGCA 
E V D L L E A 



ACCCAACCATGGCTTTCACTT 
P N H G F H 



AGGTGGACCTGCTGGAAGCAG 
E V D L It E A 



CCCAACCATGGCTTTCACTTG 
P N H G F H L 



GGTGGACCTGCTGGAAGCAGA 
V D L L E A 



CCAACCATGGCTTTCACTTGA 
P N H G F H L 



GTGGAC CTGCTGGAAGCAGAG 
V D £* X* E A E 



CAACCATGGCTTTCACTTGAG 
N H G F H L 



TGGACCTGCTGGAAGCAGAGA 
V D L L E A E 



AACCATGGCTTTCACTTGAGC 
N H G F H L S 



GGAC CT GCT GG AAGC AGAGAA 
D L L E A E 



ACCATGGCTTTCACTTGAGCT 
N H G F H L S 



G ACCT GCT GG AAGCAGAGAAT 
D L I. E A E N 



CCATGGCTTTCACTTGAGCTT 
H G F H L S 



AC CTGCTGGAAGCAGAGAATG 
D L L E A E N 



CATGGCTTTCACTTGAGCTTC 
H G F H L S F 



CCTGCTGGAAGCAGAGAATGA 
L L E A E N 



ATGGCTTTCACTTGAGCTTCA 
H G F H L S F 



CT GCT GG AAGCAGAGAAT GAC 
L L E A E N D 



TGGCTTT CACTTGAGCTTCAG 
G F H L S F 



TGCTGGAAGCAGAGAATGACC 
L L E A E N D 



GGCTTT CACTTGAGCTTCAGG 
G F H L S F R 



GCTGGAAGC AGAGAATG AC CG 
I* E A E N D 

CT GG AAGCAGAGAAT GACCGA 
L E A E N D R 

TGGAAGCAGAGAATGACCGAC 
L E A E N D R 



GGAAGCAGAGAATGACCGACT 
E A E N D R 

GAAGCAGAGAATGAC CG ACT G 
E A E N D R L 



GCTTTCACTTGAGCTTCAGGA 
G F H L S F R 



CTTTCACTTGAGCTTCAGGAT 
F H X« S F R 



TTT CACTTGAGCTTCAGGATG 
F H I* S F R M 



TTCACTTGAGCTTCAGGATGT 
F H I* S F R M 



T CACTTGAGCTTCAGGATGTT 
H L S F R M 



Fig. 20 (cont'd 1) 



T2 

TAAAAGGTAAAAATGAAAAAC 
AAAAGGT AAAAAT G AAAAAC A 
AAAGGTAAAAATGAAAAACAA 
AAGGTAAAAATGAAAAACAAA 
AGGT AAAAAT G AAAAACAAAA 
GGT AAAAAT GAAAAACAAAAA 
GT AAAAAT G AAAAAC AAAAAC 
T AAAAAT G AAAAAC AAAAAC A 
AAAAAT G AAAAAC AAAAACAA 
AAAAT GAAAAAC AAAAAC AAG 
AAATGAAAAACAAAAACAAGC 
AATGAAAAACAAAAACAAGCA 
ATGAAAAACAAAAACAAGCAA 
T GAAAAACAAAAACAAGC AAA 
GAAAAACAAAAACAAGCAAAC 
AAAAACAAAAACAAGCAAACA 
AAAACAAAAACAAGCAAACAA 
AAACAAAAACAAGCAAACAAA 
AACAAAAACAAGCAAACAAAC 
ACAAAAACAAGCAAACAAACA 
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T2 



-*CT CT AATTT AACAAT GCAT AAGAGT CAATAAAC C CT ACTTTTTTAAAAAAAA 
CCTCTAATTTAACAATGCATAAGAGTCAATAAACCCTACTTTTTTAAAAAAAA 
CT CT AATTTAACAATGCATAAGAGT CAATAAACCCT ACTTTTTTAAAAAAAA 
T CTAATTTAACAATGCAT AAG AGTCAATAAAC CCT ACTTTTTTAAAAAAAA 
CTAATTT AAC AATGCATAAGAGT CAATAAAC CCT ACTTTTTTAAAAAAAA 
TAATTTAACAATGCATAAGAGTCAATAAACCCTACTTTTTTAAAAAAAA 
AATTTAACAATGCATAAGAGT CAATAAACCCT ACTTTTTTAAAAAAAA 
ATTTAACAATGCAT AAGAGT CAATAAACCCT ACTTTTTTAAAAAAAA 
TTTAAGAATGCATAAGAGTCAATAAACCCTACTTTTTT AAAAAAAA 
T T AACAAT GCAT AAGAGT CAATAAAC CCT ACTTTTTTAAAAAAAA 
T AAC AATGCATAAGAGT CAATAAAC CCT ACTTTTTTAAAAAAAA 
AACAATGCATAAGAGT CAATAAAC CCT ACTTTTTTAAAAAAAA 
ACAATGCATAAGAGTCAATAAACCCTACTTTTTTAAAAAAAA 
CAAT GC AT AAGAGTCAATAAACCCT ACTTTTTTAAAAAAAA 
AATGCATAAGAGTCAATAAACCCTACTTTTTTAAAAAAAA 
ATGCAT AAGAGT CAATAAACCCT ACTTTTTTAAAAAAAA 
TGCAT AAGAGT CAATAAACCCT ACTTTTTTAAAAAAAA 
GCAT AAGAGT CAAT AAACCCTACTTT TTT AAAAAAAA 
CATAAGAGT CAAT AAAC CCT ACTTTTTTAAAAAAAA 
AT AAGAGT CAATAAAC CCT ACTTTTTTAAAAAAAA 
T AAGAGT CAAT AAACCCTACTTTTTT AAAAAAAA 



Fig. 20 



(cont'd 2) 



T3 



IT3 



100/124 



ACTGGGCCAAT CATT AC CT AG 
W A N H Y L 



CTGGGCCAATCATTACCTAGC 
W A N H Y L 



TGGGC CAAT CATT AC CT AGCC 
W A N H Y L A 



GGGCCAATCATTACCTAGCCA 
W A N H Y L A 



GGCCAATCATTACCTAGCCAA 
A N H Y L A 



GCCAATC ATT AC CT AGC CAAA 
A N H Y L A K 



CGGCCACAAGCGT CT CAT CAG 
G H K R L I 

GGCCACAAGCGTCTCATCAGG 
G H K R L I R 

GCCACAAGCGTCTCATCAGGG 
G H K R L I R 

C C AC AAGCGTCT CAT CAGGGA 
H K R L I R 

C AC AAGCGT CT CAT CAGGG AT 
H K R L I R D 

ACAAGCGTCTCATCAGGGATC 
H K R L I R D 



CTGAAATGCAGTCCAGACTTC 
E M Q S R L 

TGAAATGCAGTCCAGACTTCC 
E M Q S R L 

GAAATGCAGTCCAGACTTCCA 
E M Q S R L P 

AAATGCAGTCCAGACTTCCAG 
E M Q S R L P 

AAT GC AGT CCAG ACTT C CAGG 
M Q S R** L P 



CCAAT CATT AC CT AGCC AAAT 
A N H Y L A K 



CAAGCGT CT CATC AGGGAT CT 
K R L ; I R D 



ATGCAGT CCAGACTT C CAGGT 
M Q S R L P G 



CAAT CAT T ACCTAGCC AAAT C 
N H Y L A K 



AATCATTAC CT AGCCAAATC C 
. N H Y L A K S 



ATCATTACCTAGCCAAATCCG 
N H Y L A K S 



TCATTACCTAGCCAAATCCGG 
H Y I* A K S 



CATTACCTAGCCAAATCCGGC 
H Y I» A K S G 



ATTACCTAGCCAAATCCGGCC 
H Y L A K S G 



AAGCGT CT CAT CAGGG AT CT C 
K R L I R D L 

AGCGTCTCATCAGGGATCTCC 
K R L I R D L 

GCGTCTCATCAGGGATCTCCA 
R L I R D L 

CGT CT CAT CAGGGATCTCCAG 
R L I R D L Q 

GT CT CAT CAGGGATCTC CAGC 
R L I R D L Q 

T CT CATCAGGGAT CT CCAGCA 
L I R D L Q 



T GC AGT CCAGACTT C CAGGT C 
M Q S R L P G 

GCAGT CC AGACTTCCAGGT CC 
Q S R L p G 

CAGTCCAGACTTCCAGGTCCT 
Q S R L P G P 

AGT CCAGACTTCCAGGT CCT A 
Q S R L P G P 

GTCCAGACTT CCAGGTCCT AC 
S R L P G P 



TTACCTAGCCAAATCCGGCCA 
Y L A K S G 



CT CAT CAGGG AT CT CC AG C AA 
L I R D L Q Q 



T C CAGACTTCCAGGT C CT ACC 
S R L P G P T 



TACCTAGCCAAATCCGGCCAC 
Y L A K S G H 

ACCTAGCCAAATCCGGCCACA 
Y L A K S G H 

C CTAGC CAAAT CCGGCCACAA 
, Tj A K S G H 

CT AGCCAAATC CGGCCACAAG 
L A K S G H K 

T AGC CAAAT CCGGCCACAAGC 
L A K S G H K 

AGCCAAATCCGGCCACAAGCG 
A K S G H K 

GCCAAATCCGGCCACAAGCGT 
A K S G H K R 



TCATCAGGGATCTCCAGCAAG 
L X R D It Q Q 

CAT C AGGGATCT CCAGCAAGA 
I R D L Q Q 

ATCAGGGATCTCCAGCAAGAT 
I R D L Q Q D 

TCAGGGATCTCCAGCAAGATG 
I R D I* Q Q D 

CAGGGATCT CCAGCAAGATGT 
R D L Q Q D 

AGGGAT CT CCAGCAAGATGTG 
R D L Q Q D V 

GGG AT CT CCAGCAAGATGTGA 
R D L Q Q D V 



CCAGACTT CCAGGT CCT ACCG 
S R L P G P T 

CAGACTTCCAGGTCCTACCGC 
R Ij P G P T 

AGACTT CCAGGT CCTACCGCG 
R L P G P T A 

GACTTC CAGGT C CT AC CGCGA 
R L P G P T A 

ACTT CCAGGTCCT ACCGCG AG 
I* P G P T A 

CTT CCAGGT CCT ACCGCGAGG 
L P G P T A R 



T T C C AGGTCCT AC CGCG AGGG 
Ij P G P T A R 



Fig. 20 (cont'd 3) 



T C CAGGT C CTACCGCGAGGGT 
P G P T A R 



C CAGGT CCT ACCGCGAGGGT A 
P G P T A R V 



T3 

CGGGGCAGTAGTGTCTGCCAC 
R G S S V C H 

GGGGCAGTAGTGTCTGCCACG 
R G S S V C H 

GGGCAGT AGT GT CT GCC ACGT 
G S S V C H 

GGCAGTAGTGTCTGCCACGTG 
G S S V C H V 

GC AGT AGT GT CT GCC ACGTGG 
G S S V C H V 

CAGTAGTGTCTGCCACGTGGA 
S S V C H V 

AGTAGTGTCTGCCACGTGGAC 
S S V C H V D 

GT AGTGT CTGCCACGT GGACG 
S S V C H V D 

xAGTGTCTGCCACGTGGACGT 
S V C H V D 

AGTGTCTGCCACGTGGACGTC 
S V C H V D V 

GTGTCTGCCACGTGGACGTCT 
S V C H V D V 

TGTCTGCCACGTGGACGTCTC 
V C H V D V 

GT CTGCCACGT GGAC GT CT CA 
V C H V D V S 

TCTGCCACGTGGACGTCT CAG 
V C H V D V S 

CTGCCACGTGGACGTCTCAGA 
C H V D V S 

TGCCACGTGGACGTCTCAGAC 
C H V D V S D 

GCCACGTGGACGTCTCAGACA 
C H V D V S D 

CCACGTGGACGTCTCAGACAA • 
H V D V S D 



T3 
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CACCATGC C AAGG ACGAAGG 
T M P R T K 



CAC CAT GCC AAGG ACGAAGGC 
T M P R T K 

AC CATGCCAAGGACGAAGGCT 
T M P R T K A 

CCATGCCAAGGACGAAGGCTT 
T M P R T K A 

CATGCCAAGGACGAAGGCTTC 
M P R T K A 

ATGCCAAGGACGAAGGCTTCA 
M P R T K A S 

T GCCAAGGACGAAGGCTTC AG 
M P R T K A S 

GC C AAGGACGAAGGCTT C AGC 
P R T K A S 

CC AAGGACGAAGGCTT C AGC C 
P R T K A S A 

CAAGGACGAAGGCTTCAGCCC 
P R T K A S A 

AAGGACGAAGGCTTCAGC C CC 
R T K A S A 

AGGACGAAGGCTTCAGCCCCG 
R T K A S A P 

GGACGAAGGCTT CAGCC C CGG 
R T K A S A P 

GACGAAGGCTT CAGCCCCGGC 
T K A S A P 

ACGAAGGCTTCAGCCCCGGCA 
T K A S A P A 

CGAAGGCTT CAGCC CCGGCAG 
T K A S A P A 

GAAGGCTTCAGCCCCGGCAGG 
K A S A P A 

AAGGCTTCAGCCCCGGCAGGC 
K A S A P A G 

AGGCTTCAGCCCCGGCAGGCG 
K A S A P A G 

GGCTTCAGCCCCGGCAGGCGC 
A S A P A G 

GCTTCAGCCCCGGCAGGCGCA 
A S A P A G A 



1 "3 «8 J. H-5 "••!!•• "3 ,.#9 : / S f W ! 5 4 9 



G AAGC AGAGTGGTT C CGCCA 
K Q S G S A 

GAAGCAGAGTGGTTCCGCCAC 
K Q S G*" S A 

AAGCAGAGTGGTTCCGCCACC 
K Q S G S A T 

AGCAGAGTGGTTCCGCCACCG 
K Q S G S A T 

GC AG AGTGGTT C CGCC AC CGG 
Q S G S A T 

CAGAGTGGTTCCGCCACCGGC 
Q S G S "A T G 

AGAGTGGTTCCGCCACCGGCC 
Q S G S A T G 

GAGTGGTTCCGCCACCGGCCT 
S G S A T G 

AGTGGTT CCGCCACCGGCCTG 
S G S A T G L 

GTGGTTCCGCCACCGGCCTGG 
S G S A T G L 

TGGTT CCGCCACCGGCCTGGC 
G S A T O L 

GGTTCCGCCACCGGCCTGGCC 
G S A T G L A 

GTTCCGCCACCGGCCTGGCCA 
G S A T G L A 

TTCCGCCACCGGCCTGGCCAT 
S A T G L A 

TTCCGCCACCGGCCTGGCCAT 
S A T G It A 

TCCGCCACCGGCCTGGCCATG 
S A T G L A M 

CCGCCACCGGCCTGGCCATGA 
S A T G I* A M 

CGCCACCGGCCTGGCCATGAT 
A T G L A M 

GCCACCGGCCTGGCCATGATC 
A T G L A M X 

CCACCGGCCTGGCCATGATCA 
T G r A M I 

CACCGGCCTGGCCATGATCAC 
T G L A M I 



CACGTGGACGTCTCAGACAAG 
H V D V S D K 



ACCGGCCTGGCCATGATCACA 
T G L A M I T 



ACGT GGACGT CT CAGACAAGG 
H V D V S D K 



CGTGGACGTCTCAGACAAGGC 
V D V S D K 



Fig. 20 (cont'd 4) 



T3 - ( 

GGTCTGGTCAACCAAACAGAC 
G L V N Q T D 

GTCTGGTCAACCAAACAGACA 
G L V N Q T D 

T CTGGT C AAC C AAAC AGAC AA 
L V N Q T D 

CTGGTCAACCAAACAGACAAG 
L V N Q T D K 

TGGT CAACC AAAC AGAC AAGG 
L V N Q T D K 

GGTCAAC C AAAC AGACAAGG A 
V N Q T D K 

GT GAACCAAAC AGACAAGGAG 
V N Q T D K E 

TCAACCAAACAGACAAGGAGA 
V N Q T D K E 

CAAC CAAACAG ACAAGGAGAA 
N Q T D K E 

.CCAAACAGACAAGGAGAAA 
N Q T D K E K 

ACCAAACAGACAAGGAGAAAG 
N Q T D K E K 

CCAAACAGACAAGGAGAAAGG 
Q T D K E K 

CAAACAGACAAGGAGAAAGGC 
Q T D K E K G 

AAACAGACAAGGAGAAAGGCA 
Q T D K E K G 

AACAGACAAGGAGAAAGGCAT 
T D K E K G 

ACAGACAAGGAGAAAGGC AT C 
T D K E K G I 

CAGACAAGGAGAAAGGCATCT 
T D K E K G I 

, AC AAGGAGAAAGGCAT CTC 
D K E K G I 

GACAAGGAGAAAGGCATCTCA 
D K E K G I S 



T3 
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TT C AT GG AT CCT CACTCTCCT 
H G S S L S 

TCATGGATCCTCACTCTCCTT 
H G S S L S 

CAT GG AT C CT C ACT CT CCTTG 
H G S S L S L 

ATGGATCCTCACTCTCCTTGG 
H G S S L S L 

TGGATCCTCACTCTCCTTGGT 
G S S L S L 

GGATCCT CACT CT CCTTGGTT 
G S S L S L V 

GAT C CTCACTCT C CTTGGTTT 
G S S L S L V 

ATCCTCACTCTCCTTGGTTTC 
S S L S L V 

TCCTCACTCTCCTTGGTTTCC 
S S L S L V S 

CCT CACT CTCCTTGGTTTCC A 
S S L S L V S 

CTCACTCTCCTTGGTTTCCAG 
S L S L V S 

TCACTCTCCTTGGTTTCCAGC 
S L S L V S S 

CACTCTCCTTGGTTTCCAGCA 
S L S I* V S S 

ACTCTCCTTGGTTTCCAGCAC 
Ij s l V S S 

CTCTCCTTGGTTTCCAGCACA 
L, S L V S S T 

TCTCCTTGGTTTCCAGCACAT 
L S Ii V S S T 

CTCCTTGGTTTCCAGCACATC 
S L V S S T 

TCCTTGGTTTCCAGCACATCG 
S L V S S T S 

C CTTGGTTT CCAGCACATCGT 
S L V S S T S 



^^^CCTTGGTTTCCAGCACATC 

S L V S S T 

TCCTTGGTTTCCAGCACATCG 
S L V S **"S T S 

CCTTGGTT T CCAGCACATCGT 
S Ii V S S T S 

CTTGGTTT CCAGCACATCGT C 
L V S S T S 

TTGGTTTCCAGCACATCGTCA 
L V S S T S S 

TGGTTTC CAGC AC AT CGTCAG 
L V S S T S S 

GGTTT CCAGCAC AT CGT CAGT 
V S S T S S 

GTTT CC AGC ACAT CGT CAGTT 
V S S T S S V 

TTTCCAGCACATCGT CAGTTT 
V S S T S S V 

TT CCAGCACAT CGT CAGTTTA 
S S T S S V 

T CCAGCAC ATCGTCAGTTT AT 
S S T S S V Y 

CCAGCACAT CGT CAGTTT ATT 
S S T S S V Y 

CAGCACAT CGT CAGTTT ATT C 
S T S S V Y 

AGCACAT CGT CAGTTT ATT CT 
S T S S V Y S 

GCACAT CGT CAGTTT ATT CT A 
S T S S V Y S 

CACATCGTCAGTTTATTCTAC 
T S S V Y S 

ACATCGTC AGTTTATT CT ACA 
T S S V Y S T 

CAT CGT CAGTTT ATT CTACAC 
T S S V Y S T 

AT CGT CAGTTT ATT CTACAC C 
S S V Y S T 



ACAAGGAGAAAGGCATCTCAT 
D K E K G I S 



CTTGGTTT CCAGCACATCGTC 
L V S S T S 



TCGT CAGTTT ATTCTACACCA 
S S V- Y S T P 



CAAGGAGAAAGGCATCTCATC 
K E K G I S 



TTGGTTTCCAGCACATCGTCA 
L V S S T S S 



CGT CAGTTT ATT CTACAC C AG 
S S V Y S T P 



T3 



Fig. 20 (cont'd 5) 



GGAAGAACT GGGT CAAT G AGTT ACGC AGCT C C 
KNWVNELRSS 



J 



09 91<4-SM-9 




it r 





AAGAAGAAGAGGAAGAACT GGGTCAAT GAGTT ACG< 



ITCCTTCAAG 
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KKKRKNWVNE LRS SFK 
T3 

T CT CT AATTT CAGAATGC ATGGAT A 
T3 

AGGAGATGAAGCTGACAG AT AT CCGCTT AGAAGCT CT 
T3 

GATTCCAGACCACACGTCTTTCTTATCG 



Fig- 20 (cont'd 6) 
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Alignment ^^ the T P rotein f U 7 / 7 l*f 3f 7 

Note: The N-terminus of protein T2 was omitted in the alignment, since it 
has no significant homology to the T protein and the T3 protein. 



T MDLSSEMNRHGKNPVSHKLEDQK KIYTDWANHYLAKSGHKRLIKDLQ 60 

T3 NQPERLNSQVLQGLQEPAGEGLPLRKSGSVENGFDTQIYTDWANHYLAKSGHKRLIRDLQ 

T2 

* * * * * ********************** 

T QDIADGVLLAEIIQIIANEKVE 120 

T3 QDVTDGVLLAQI IQWANEKIEDINGCPKNRSQMIENIDACLNFLAAKGINIQGLSAEEI 

T2 

* * ****** s *** a .****.*******. ^ ****** ^ * ** **** > * - * i ******** 

T RNGNLKAILGLFFSLSRYKQQQ-HHQQQYYQS LVELQQRVT 180 

T3 RNGNLKAILGLFFSLSRYKQQQQQPQKQHLSSPLPPAVSQVAGAPSQCQAGTPQQQVPVT 

T2 

********************** . *,* * . * ** 

T HASP PSEASQAKTQQDMQSRLPGP - SRVPAAGS SSKVQGASNLN- - RRSQS FNS I 240 

T3 PQAPCQPHQPAPHQQSKAQAEMQSRLPGPTARVSAAGSEAKTRGGSTTANNRRSQSFNNY 

T2 

* * * *^* ******** m ** **** a * ,* *. ******* 

T DKNK PP-- 300 

T3 DKSKPVTSPPPPPSSHEKEPLASSASSHPGMSDNAPASLESGSSSTPTNCSTSSAIPQPG 

T2 

* * * * * 

T N - 360 

T3 AATKPWRSKSLSVKHSATVSMLSVKPPGPEAPRPTPEAMKPAPNNQKSMLEKLKLFNSKG 
T2 



T 420 

T3 GSKAGEGPGSRDTSCERLETLPSFEESEELEAASRMLTTVGPASSSPKIALKGIAQRTFS 
T2 



T --YAN GNEK 480 

T3 RALTNKKSSLKGNEKEKEKQQREKDK^ 

T2 

* * * * * 

T 540 

T3 PKKS S KI AS F I PKGGKLNSAKKE PMAPSHSG I PKPGMKSMPGKS PSAPAPS KEGERSRSG 

T2 



T 600 

T3 KLSSGLPQQKPQLDGRHSSSSSSliASSEGKGPGGTTLNHSISSQTVSGSVGTTQTTGSNT 
T2 



Fig. 21 



T3 VSVQLPQPQQQYNHPNT. 
T2 
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PFLYRSQTDTEGNVTAE S S STGVSVE 



099 :l«»li-5^gi ,» tfK 



660 



rTKTGQPALE 



X GEDPETRRMRTVKNIADLRQNLEETMSSLRGTQISHSTLETTFDSTVTTEVNGRTIP 72 0 

T3 ELTGEDPEARRLRTVKNIADLRQNLEETMSSLRGTQVTHSTLETTFDTNVTTEMSGRSIL 

X2 DPESQRKRTVQMVLDLRQNLEETMSSLRGSQVTHSSLEMTCYD - - SDDANPRSVS 

*****_* ***.*. ***************.*..**.** * . . *.. 



X NLTSRPTPMTWRLGQACPRLQAGDAPSLGAGYP-RSGTSRFIHTDPSRFMYTTPLRRAAV 78 0 

X3 SLTGRPTPLSWRLGQS S PRLQAGDAPSMGNGYPPRANASRF INTE SGRYVYSAPLRRQLA 

X2 SLSNRSYPLSWRYGQSSPRLQAGDAPSVGGSCRSEGTPAWYMHGERAHYSHTMPMR--SP 

*. * *..** **..**********.* *.* 

X S RLGNM S Q I DM S E KA - S S DLDM S - S E VDVGG YM S DGD I L GKS LRT DD I NS G YMTDGGLNL 840 

X3 SRGSSVCHVDVSDKA-GDEMDLEGISMDAPGYMSDGDVLSKNIRTDDITSGYMTDGGLGL 

X2 SKLSHISRLELVESLDSDEVDLK SGYMSDSDLMGKTMTEDDD ITTG 



X YTRSLNRI PD - TATSRDI I QRGVHD VTVD AD S WDD S S SVS SGLSDTLDNI STDDLNTTS S 900 
T3 YTRRLiNRLPDGMAWRETLQRNTSLGLGDADSWDDSSSVSSGISDTIDNLSTDDINTSSS 
T 2 WDESSS I S SGLSDAS DNL, S SEEFNAS S S 

* * -k * ^ * * 



X VSSYSNITVPSRKN- - TQLRTDSEKRSTTDET - -WDSP- -EELKKPE- -EDFDSHGDAG- 960 

T3 ISSYANTPASSRKN- -LDVQTDAEKHSQVERNSLWSG DDVKKSDGGSDSGI KME PG - 

T2 LNSLPSTPTASRRNSTIVLRTDSEKRSLAESGLSWFSESEEKAPKKLEYDSGSLKMEPGT 
* ..**.**.* * * . . * 

T GKWKTVSSGLPEDPEK- AGQKASLSVSQTGSWRRGMSAQGGAPS RQKAGTSALKTP - 102 0 

T3 SK^RRNPSDVSDESDKSTSGKKNPVISQTGSWRRGMTAQVGITMPRTKASAPAGALKTPG 
T2 SKWRRERPESCDDSSKGGELKKPISLGHPGSLKKGKTPPVAVTSP- - ITHTAQSALKVAG 
** . . . * * ..**..*. . *** 

T -GKTDDAKASEKGKAPLKGSSLQRSPSDAGKSSGDEGKK- -PPSGIGRSTATSSFGFKKP 1080 
T3 XGKTDDAKVSEKGRLSPKASQVKRSPSDAGRSSGDESKKPLPSSSRTPTANANSFGFKKQ 

T2 KPEGKATDKGKLAVKNTGLQRSSSDAGRDRLSDAKK- - PPSGIARPSTSGSFGYKKP 

★ * . . . ** ****. ** * * . ***.** 

X SG-VGSSAMITSSGATITSGSATLGKIPKSAAIGGKSNAGRKTSLDGSQNQDDWLHVSS 114 0 
T3 SGSATGLAMITASGVTVTSRSATLGKI PKSSALVSRS - AGRKSSMDGAQNQDDGYLALSS 

T2 PP-ATGTATVMQTG GSATLSKI QKSSGI PVKPVNGRKTSLDVSNSAE PGFLAPGA 

* 9 * **** ** **. . . * 

T KTTLQYRSLPRPSKSSTSGIPGR-GGHRSSTSSID- SNVSSKSAGATTSKLREPTKIGSG 12 00 
T3 RTNLQYRSLPRPSKSNSR- -NG- -AGNRSSTSSID- SNISSKSAGLPVPKLREPSKTALG 
T2 RSNIQYRSLPRPAKSSSMSVTGGRGGPRPVSSSIDPSLLSTKQGGLTPSRLKEPTKVASG 
b # ********. ** . * * * .**** * .*.* * .*.**.* * 

T RSSPVTVNQTDKEKEKVAVSDSESVSLSG-SPKSSPTSASACG-AQGLRQPGSKYPDIAS 1260 
T3 SSLPGLVNQTDKEKG- - ISSDNESVASCN-SVKVNPAAQPVSSPAQTSLQPGAKYPDVAS 

T2 RTTPAPVNQTDREKE KAKAKAVALDSDNISLKSIGSPESTPKNQASHPTATKLAELP 

* . . . .* 

T PXFRRLFGAKAGGKSASAPNTEGVKSSSVMPSPSTTIoARQGSLESPSSGTGSMGSAGGLS 13 2 0 

T3 PTLRRLFGGKP - TKQVPI ATAENMKNSWI SNPHATMTQQGNLDS PS - GSGVLS --S 

T2 PTPLRAT - AKS FVKP P SLANLDKVN - SNSLDLPS S SDTTHAS KVPDLHATS S AS 

* * * * * ...*.*.--. • 



Fig. 21 (cont'd 1) 
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GSSSPLFNKPSDLTTD^^B^HSIASSPASVHS 13 8 0 

GSSSPLYSKNVDLN QS PLAS S PS SAHS AP SNSLTWGTNAS S S SAVS KDGLGFQS 

--GGPLPS CFTPSPAPILNINSASFSQGLELMSGFSVPKETRMYPK 



MTSLHTSSESIDLPLS HHGSLSGLTTG THEVQSLLMRTGSVRSTLSES - - 1440 

VSSLHTSCESIDISLSSGGVPSHNSSTGLIASS KDDSLTPFVRTNSVKTTLSESPL 

LSGLHRSMESLQMPMSLPSAFPSSTPVPTPPAPPAAPTEEETEELTWSGSPRAGQLDS - - 
^ ***** ^ * . . ..*.. .* 

MQLDRNTLPKKGLRYTPSSRQANQEEGKEWLRSHSTGGL 15 0 0 

S SPAAS PKFCRSTLPRKQDSDPHLDRNTLPKKGLRYTPTSQLRTQEDAKEWLRSHSAGGL 

NQRDRNTLPKKGLRYQLQS QEETKERRHSHTIGGL 

************ * **. ** .**. *** 

QDTGNQSPLVSPSAMSSSAAGKYHFSNLVSPTNLSQFNLPGPSMMRSNSIPAQDSSFDLY 1560 
QDTAANSPFSSGSSVTSPSGTRFNFSQLASPTTVTQMSLSNPTMLRTHSLSNADGQYDPY 

PESDDQSELPSPPALPMSLSAKGQLTNIVSPTAAT TPRITRSNSIPTHEAAFELY 

* * .... *** . *.*..*. . ..* 

DDSQLCGSATSLEERPRAISHSGSFRDSMEEVHGSSLSLVSSTSSLYSTAEEKAHSEQIH 162 0 
TDSRFRNSSMSLDEKSRTMSRSGSFRDGFEEVHGSSLSLVSSTSSVYSTPEEKCQSE-IR 
SGSQMG-STLSLAERPKGMIRSGSFRDPTDDVHGSVLSLASSASSTYSSAEERMQSEQIR 
* * ** * _ ****** **** *** **.** **. **. .** *. 

KLRRELVASQEKVATLTSQLSANAHLVAAFEKSLGNMTGRLQSLTMTAEQKESELIELRE 168 0 
KLRRELDASQEK^SALTTQLTANAHLVAAFEQSLGNMTIRLQSLTMTAEQKDSELNELRK 
KLRRELESSQEKVATLTSQLSANANLVAAFEQSLVNMTSRLRHLAETAEEKDTELLDLRE 
****** ,*****..**.**.***.******.** *** **. *. ***.*..** .** 

T I EMLKAQNS AAQAAI QGALNGPDHPPK DLRIRRQHSSESVSSINSATSHSS 174 0 

T I ELLKKQNAAAQAAINGVINTPELNCKGNGTAQS ADLRI RRQHS SDSVS S INSATSHS S 

T I D F LKKKNS E AQAV I QGALNAS ETT P K ELRIKRQNSSDSISSLNSITSHSS 

** ** * *** * * * b * . *** . **.**.*.**. ** ***** 

I G S GND AD S KKKKKKNW L RSSFKQAFGKKKSTKPPSSHSDIEELT- -DSSLPASPKL 1800 

VGSNIESDSKKKKRKNWVNELRSSFKQAFGKKKSPKSASSHSDIEEMT- -DSSLPSSPKL 

IGSSKDADAKKKKKKSWL RS S FNKAFS I KKGPKSAS S YSD I EE I ATPDS SAPS S PKL 

** * ***** * **** ,** ** * ** *****.. *** *.**** 

PHNAGDCGSASMKPSQSASAI CECTEA E AE 1 1 LQLKS ELRE 1860 

PHNGSTGSTPLLRNSHSNSLI SECMDS EAETVMQLRNELRD 

QHGSTETASPSIKSSTSSSVGTDVTEGPAHPAPHTRLFHANEEEEPEKKEVSELRSELWE 
* * * * *..*. ** 

KELKLTD I RLEALS S AHHLDQ I REAMNRMQNE I E I LKAENDRLKAETGNTAKPTRP P S ES 1920 
KEMKLTDIRLEALSSAHQLDQLREAMNRMQSEIEKLKAENXJRLKSES -QGSGCSRAPSQV 
KEMKLTDIRLEALNSAHQLDQLRETMHls^QLEVDLLEAENDRLKVAP--GPSSGSTPGQV 
**.********** ***.***.*★.*. ** *.. * ******* * . 

SSSTSSSSSRQSLGLSLNNLNITEAVSSDILLDDAGDATGHKDG-RSVKIIVSISKGYGR 198 0 

SISAS PRQSMGLSQHSLNLTESTSLDMLLDDTGECSARKEGGRHVKIWSFQEEMKW 

PGSSALSSPRRSLGLALTHSFGPSLADTDLSPMDGISTCGPKEE-VTLRVVVRMPPQHII 
* **** *. * 
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T AKI)QKSQAYLIGSIGVS|^KWDVLDGVIRRLFKEYVFRIDTSTSLG^fcpCIASYCIGD 2 04 0 

T3 kedsrphlfligcigvsI^kwdvldgwrrlfkeyiihvdpvsqlgSIRsvlgysige 

T2 kgdlkqqefflgcskvsgkvdwkmldeavfqvfkdyiskmdpastlglstesihgysish 

* . . . . * . **** * * * m t * * * -k ^ * * * .... * * 

T lirshnlevpellpcgylvgdnniitvnlkgveensldsfvfdtlipkpitqryfnllme 2100 
T3 ikrsntsetpellpcgylvgenttisvtvkglaensldslvfeslipkpilqryvsllie 
T2 vkrvldaeppemppcrrgvnn — isvslkglkekcvdslvfetlipkpmmqhyislllk 

* * **_ * * * _ * * . * * . * ..** ** * * . 

T HHRI I LSGP SGTGKTYLANKLAE YVI TKSGRKKTEDA I ATFNVDHKS S KELQQYLANLAE 2160 
T3 HRRIILSGPSGTGKTYLANRLSEYIVLREGRELTDGVIATFNVDHKSSKELRQYLSNLAD 
T2 HRRLVLSGPSGTGKTYLTNRLAEYLVERSGREVTEGIVSTFNMHQQSCKDLQLYLSNLAN 

T QCSADNNGVELPWIILDNLHHVGSLSDIFNGFLNCKYNKCPYIIGTMNQGVSSSPNLEL 2220 
T3 QCNSENNAVDMPLVI ILDNLHHVSSLGEI FNGLLNCKYHKCPYI-IGTMNQATSSTPNLQL 
T2 QIDRETGIGDVPLVILLDDLSEAGSISELVNGALTCKYHKCPYIIGTTNQPVKMTPNHGF 

T HHNFRWLCANHTEPVKGFLGRYLRRKLIEIEIERNIRNNDLVKIIDWIPKTWHHLNSFL 2280 
T3 HHNFRWVLCANHTEPVKGFLGRFLRRKLMETEISGRVRNMELVKI IDWI PKVWHHLNRFL 
T2 HLSFRMLTFSNWEPANGFLVRYLRRKLVESDSDINANKEELLRVLDWPKLWYHLHTFL 

T ETHSSSDVTIGPRLFLPCPMDVEGSRVWFMDLWNYSLVPYILEAVREGLQMYGKRTPWED 23 40 
T3 EAHSSSDVTIGPRLFLSCPIDVDGSRVWFTDLWNYS 1 1 PYLLEAVREGLQLYGRRAPWED 

T2 EKHSTSDFLIGPCFFLSCPIGIEDFRTWFIDLWl^NSIIPYLQEGAKDGIKVHGQKAAWED 

★ **** *** * * * * ^ # ★ * * * * * * * ^ * * ^ * * * * 

T PSKV7VLDTYPWSSATLPQESPALLQLRPEDVGYESCTSTKEATTSKHIPQTDTEGDPLMN 24 00 
T3 PAKWVMDTYPWAASPQQHEWPPLLQLRPEDVGFDGYSMPREGSTSKQMPPSDAEGDPLMN 

T2 PVEWWDTLPWPSAQQDQSK--LYHLPPPTVGPHSIASPPEDRTVKDSTPSSIjDSDPLMA 

★ ** ** ** _ 9 * ^* * ** a * * * ^ **** 

T MLMKLQEAANYSSTQSCDSES- -TSHHEDILDSSLESTL 
T3 MLMRLQEAANYSSPQSYDSDSNSNSHHDDILDSSLESTL 
T2 MLLKLQEAANYIE- - SPDRET ILDPNLQATL 
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Alignment of the T protein with the P0M121 protein 

10 20 30 40 SO -60 

T- Protein MDLSSEMNRHGKNPVSHKLEDQKK^ 

P0M121 



T-Protein 

POM121 



QIIAIIEKVEDINGCPRSQSQ^IENVDV 

MS P AAAAADGGERRRP PLGVREGRGR-TRGCGGPAGAAAIiGLALLGLAL 



T- Protein 
P0M121 



SLSRYKQQQHHQQQYYQSLVEIiQQRVTHASPPSEASQAKTQQDMQSRLPGPSRVPAAGSS 
YLV -p AAAALAWLAVG AS AAWWGLS RE PRG P - - 



T-Protein 
H121 



S KVQGASNIiNRRS QS FNS ID KNKP PNYANGNEKG ED PETRRMRTVKHIADIiRQNIjBETMS 
RGLSSFVRBSR RHPRP ALTAS PLPAKS P VNGSLCEPRS 



T-Procein 
POM121 



T-Protein 
POM121 



T-Protein 
P0M121 



SIiRGTQISHSTLETTPDSTVTTEVNGRTIPNLTSRPTPMTWRLGQACPRLt^GDAPSLGA 
PLGGPDPAELLLMGSYLG KPGPPEPALPQD-PRDRPGRRPPSRS 



GYPRSGTSRFIHTDPSRFMYTTPLRRAAVSRLGNMSQID^ 

PPSSSTAQRVHHVYP ALPTPLLRPSRR PPHRDCGPLS 



-*• * * * 



S DGD ILGKSLRTDD IKS GYMTDGGLNL YTRSLNRI PDTATSRD I 1 QRGVHDVTVDAD S WD 
S RFVITPR - RRYP IQQAQYS LLGALP WCWITGGHKKAVLS ARNS - RMVCS PVTVRIAPPD 



* * * 



*k Protein 
POM121 



DSSSVSSGLSDTLDttlSTDDLm^SSVSSYSN^ 
S;KXjFRSPMPEQILSTTIjSSP$SNAPDPCAKET^ 



„*. * ** * w * * * + * *■ 



T-Protein 
POM121 



EELKKPEEDFDSHGDAGGXWKTVSSGLPEDPEKAGQKASLSV^ 

LHLDGQENKRRRHDSSG SGHSAFE PLVANGVPAAFVP KPGSLi KRS LAS QS SDDH 



T-Protein 
POM121 



RQK-AGTSALKTPGKTDDAKASEKGK-APLKGSSrjQRSPSDAGKSSGDEGKKPPSGIGRS 
LNKRSRTSSVSSLTSTGTGGIPSSSRNAITSSYSSTRGVSQLWKRSG-PTSSPFSSPASS 



T-Protein 
POM121 



TATSSFGFK3CPSGVGSSAMITSSGATITSGSATLGKIPKSAAIGGKSNAGRKTSLDGSQN 
RSQTPERPAKKTREEE PCHQS SSS APLVTDKES PGEKVTDPATGKQQSLWTS PPTPGS SG 



T-Protein 

POM121 



QDDVVLHVSSKTTLQYRSLPRPSKSSTSGIPGRGGHRSSTSSIDSNVSSKSAGATTSKLR 
QRKRKIQLLPSRRGDQLTLPPP P--ELG- - YSITAEDLDMERR AS LQ 



Fig. 22 



i 



0*3*31 -nm-sNMai 



T-Protein. 
POM121 
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EPTKIGSGRSSPVTWQTDKEKEKVAVSDSESVSIiSGSPKSSPTSASACGAQGIiRQPGSK 
WFNKVLEDKTDDASTPATDTSP ATSPPFTLTL P TVGPAASPASLPAPSS - 



T-Protein 
POM121 



YPDIASPTFRRLFGAKAGGKSASAPNTEGVKSSSVMPSPSTTI^ 

1^LLESLKKMQESPAPSSSEPPB--AATVAAPSPPKTPSLLAPLVSP 



*** * 



T-Procein 

POM121 



T-Protein 
POM121 



SAGGIjSGSSSPLFNKPSDLTTDVISLSHSLASSPASVHSFTSGGLVWAANMSSSSAGSro 
LTG PL AS TS S DS KPTTTFLGLAS AS S ATP LTDTKAP GVS Q AQLCVSTP AATAP 



* * * ** 



TPSYQSMTSIMTSSESIDLPLSHHGSLSGLTTGTHEVQSLX^TGSWSTLSESMQLDRN 
SP TP ASTLFGMLSPPASSSS IATPGPACAS PMFKP I FPATPKSE SDN 



** * * * 



T-Protein TLPKKGLRYTP S SRQANQEEGKEWLRSHSTGGIiQDTGNQS PLVS P SAMS S SAAGKYHFSN 
0M121 PLP TS S S AATTTP AS TAL PTTATAT AHTFKP I FES VE P FAAMP - - 



T-Protein 
POM121 



LVSPTNLSQF^PGPSKn^SNSIPAQDSSFDLYDDSQL^ 

L3PPFSLKQTTAPATTAATSAPLLTG L GTATST VATGTTAS 



* * * * -k 



T-Protein S MEEVHGSSLSL VS S TS SLYS TAEEKAHSEQ IHKLRRELVAS QEKVATLTSQIiSANAHLV 
POM12 1 AS KPVFG FGVTTAAS TASTXAS TS QS ILFGGAPPVTAS S SAP ALAS IFQFGKPLA 



T-Protein AAFEKSLGNMTGRLQSLTMTAEQKESELIEIiRBTIEMLKAQNSAAQAAIQGALNGPDHPP 
POM12 1 PAAS VAGTS FSQS LAS SAQTAASNS S - -GGFSGFGGTLTTSTSAPATTSQPTLTFSNTVT 



-Protein 
>OM121 



KDLRIRRQHSSE -S VSS INSATSHS SIGSGOT>ADSKKKKKKNWLRSSFKQAFGKKKSTK- 
PTPNIPFSASAKPALPTYPGANSQPTFG-ATDGATKP ALAPSFGSSFTFGNSVAS 



* * * * * 



* ** * + 



T-Protein 
POM121 



PPSSHSDIEELTDSSLPASPKLPHl^GDCGSASMKPSQ 

APSAAPAPAAFGGAAQPAFGGLKASASTFG TPASTQPAFGSTTS VFSFGSA 



r* * * * 



T-Protein 
POM121 



LREKELKLTD I RLEALS S AHHLDQI REAMNRMQNE I EILKAENDRLKAETGNTAKPTRPP 
TTS - GFGAAAATTQTTHSGS - -- SSSLFGSSTPS - PF 



T-Protein 
POM121 



SBSSS STSSSSSRQSLGLSLNNLNITBAVSSDIIiLDDAGDATGH 

TFGGSAAPAGGG GFGLS ATPGTGSTSGTFSFGSGQS GT TGTTTSFGGSLSQUT 

*- . - * * *.* * * * * 



T- Protein 
POM121 



GRAKDQKSQAYIjIGSIGVSGKTKWDVIjDGVIRRIjFKEYVTRIDTSTSLGLSS 

LGAPSQSS- -PFAFSVGSTPESKP VFGGTSTPTFGQS APAPG - - -V 

*** *_*.«• . , 



Fig. 22 (cont'd 1) 



T- Protein 
POM121 



O «3 9i .:! «4 i5 «+ «3 ,„ O 6 O S O iP 
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GTTGS SLS FGAPS TPAQGFVG - - - VGPFGSGAPSFSIGAGSKTPGARQRLQAR 




TrProtein MEHHRIILSGPSGTGKTYLAl^KLAEYVITKSG^ 

POM12 1 RQHTRKK 



T-Protein 

POMX21 



AEQCS ADNNGVELPWI ILDNLHHYGSIiSD I FNGFLNCKYNKCPYI IGTMNQGVS S S PNL 



T-Proteiiz 
POM121 



EliHHNFRWVXiCANHTEPVKGFLGRYLRRKLIE IEI ERNIRNNDLVKI IDWI PKTWHHLNS 



T-Protein FLETHSSSDVTIGPRLFLPCPMDVEGSRVWFMDLWIT^SLVPYILEAVi^GLQMyGKRTPW 
^*>M121 



T- Proce in EDPSKWVIiDTYPWSSATLPQESPALLQLRP]^ 

P0M121 . 



T-Protein MNMLMKLQEAANYSSTQSCDSESTSHHEDILDSSLESTL 
POM121 



Fig, 22 (cont'd 2) 
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Expression of the T gene family. 

a fetal tissue: left : T gene; middle : T2 gene; right : T3 gene. 

He = heart; Br = brain; Lu = lungs; Li = liver; Ki = kidney 
b adult tissue: left : T gene; middle : T2 gene; right : T3 gene. 

He = heart; Br = brain; PI = placenta; Lu = lungs; Li - liver; Mu = 

skeletal muscle; Ki = kidney; Pa = pancreas 
c adult brain regions: left : T gene; middle : T2 gene; right : T3 gene. 

Cer = cerebellum; Cor = cerebral cortex; Med = medulla; Sco = spinal 

cord; Opo = occipital pole; Flo = frontal lobe; Tlo = temporal lobe; 

Put = putamen 2. 

=• — 

Fig. 23 
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Figure legend of immunohisto and electron microscopy: 

a == brain stem. CG central grey = central grey of the brain stem 

b = hippocampus, dg = dental gyrus; CA3 cornu ammonis 3, both subregions 
of the hippocampus formation 



c = 



electronmicroscopic picture. N - nucleus, He heterochromati^ 



Fig. 25 



09/914549 



116/124 






Sua 



' ' ' ' ' ' ' '^1 - , , , 1 j | 

i - m 1.500 

eta Position ' J r I 



l.fi 



■ III 



CO - 



I 

500 



I 



H 500 1,000 

Position 




domain 



